14834

Posts

Nov, 8

2nd International Conference on Robotics and Vision (ICRV), 2016

Submission Date: Before February 10, 2016 Publication: All papers will be published in the conference Proceedings and indexed by Scopus, Ei Compendex, Web of Science (CPCI) and etc. Supported by: Hosei University, Japan University of Toyama, Japan North Carolina Agricultural and Technical University, USA Fort Hays State University, USA Nanyang Technological University, Singapore Keynote Speakers: […]
Nov, 8

2nd International Conference on Virtual Reality (ICVR), 2016

Submission Deadline: January 25, 2016 Publication: International Journal of Computer Theory and Engineering (IJCTE) Abstracting/Indexing: Electronic Journals Library, EBSCO, Engineering & Technology Digital Library, Google Scholar, INSPEC, Ulrich’s Periodicals Directory, Crossref, ProQuest, WorldCat, and EI (INSPEC, IET).ET). Keynote Speaker: Prof. Charles M. Falco University of Arizona, USA Conference Chairs: Prof. Charles M. Falco, University of […]
Nov, 8

The 8th International Conference on Digital Image Processing (ICDIP), 2016

******History**** ICDIP conferences have been held annually in Los Angeles (USA), Athens (Greece), Beijing (China), Kuala Lumpur (Malaysia),Chengdu (China), Singapore, and Bangkok (Thailand) Since 2009. So far the previous conference proceedings have been indexed by Ei Compendex and Scopus successfully! Sponsorship ICDIP 2016 is sponsored by Sichuan Province Computer Federation! Publication Accepted (Registered and Presented) […]
Nov, 8

5th International Conference on Software and Information Engineering (ICSIE), 2016

Submission Date: Before February 10, 2016 Publication: Journal of Software (JSW, ISSN: 1796-217X) Abstracting/Indexing: EBSCO, DOAJ, ProQuest, INSPEC, ULRICH’s Periodicals Directory, WorldCat, CNKI,etc. Lecture Notes on Information Theory (LNIT, ISSN: 2301-3788) Abstracting/Indexing: EI (INSPEC, IET), Ulrich’s Periodicals Directory, Google Scholar, EBSCO, Engineering & Technology Digital Library and etc. Supported by: Hosei University, Japan University of […]
Nov, 8

Free Launch: Optimizing GPU Dynamic Kernel Launches through Thread Reuse

Supporting dynamic parallelism is important for GPU to benefit a broad range of applications. There are currently two fundamental ways for programs to exploit dynamic parallelism on GPU: a software-based approach with software-managed worklists, and a hardware-based approach through dynamic subkernel launches. Neither is satisfactory. The former is complicated to program and is often subject […]
Nov, 8

Deep Learning for Computer Vision: A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks

In recent years, Deep Learning techniques have shown to perform well on a large variety of problems both in Computer Vision and Natural Language Processing, reaching and often surpassing the state of the art on many tasks [1] [2] [3]. The rise of deep learning is also revolutionizing the entire field of Machine Learning and […]
Nov, 8

High Level Synthesis and Evaluation of the Secure Hash Standard for FPGAs

Secure hash algorithms (SHAs) are important components of cryptographic applications. SHA performance on central processing units (CPUs) is slow, therefore, acceleration must be done using hardware such as Field Programmable Gate Arrays (FPGAs). Considerable work has been done in academia using FPGAs to accelerate SHAs. These designs were implemented using Hardware Description Language (HDL) based […]
Nov, 8

On the Effectiveness of OpenMP teams for Programming Embedded Manycore Accelerators

With the introduction of more powerful and massively parallel embedded processors, embedded systems are becoming HPC capable. In particular heterogeneous on-chip systems (SoC) that couple a general-purpose host processor to a many-core accelerator are becoming more and more widespread, and provide tremendous peak performance/watt, well suited to execute HPC-class programs. The increased computation potential is […]
Nov, 8

Data Stream Classification using Random Feature Functions and Novel Method Combinations

Big Data streams are being generated in a faster, bigger, and more commonplace. In this scenario, Hoeffding Trees are an established method for classification. Several extensions exist, including high-performing ensemble setups such as online and leveraging bagging. Also, $k$-nearest neighbors is a popular choice, with most extensions dealing with the inherent performance limitations over a […]
Nov, 4

Efficient Sparse Matrix-Vector Multiplication on GPUs using the CSR Storage Format

The performance of sparse matrix vector multiplication (SpMV) is important to computational scientists. Compressed sparse row (CSR) is the most frequently used format to store sparse matrices. However, CSR-based SpMV on graphics processing units (GPUs) has poor performance due to irregular memory access patterns, load imbalance, and reduced parallelism. This has led researchers to propose […]
Nov, 4

Performance of GTX Titan X GPUs and Code Optimization

Recently Nvidia has released a new GPU model: GTX Titan X (TX) in a linage of the Maxwell architecture. We use our conjugate gradient code and non-perturbative renormalization code to measure the performance of TX. The results are compared with those of GTX Titan Black (TB) in a lineage of the Kepler architecture. We observe […]
Nov, 4

Accelerating Twisted Mass LQCD with QPhiX

We present the implementation of twisted mass fermion operators for the QPhiX library. We analyze the performance on the Intel Xeon Phi (Knights Corner) coprocessor as well as on Intel Xeon Haswell CPUs. In particular, we demonstrate that on the Xeon Phi 7120P the Dslash kernel is able to reach 80% of the theoretical peak […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: