high performance computing on graphics processing units: hgpu.org

Posts

Aug, 3

2nd International Conference on Mechatronics and Robotics Engineering, 2016

Submission Method: Please log in the Electronic Submission System; and submit your paper; http://www.easychair.org/conferences/?conf=icmre2016 Call for Papers: Robotics and Mechanical Engineering Actuator design, robotic mechanisms and design, robot kinematics and dynamics Agile Manufacturing Agriculture, construction, industrial automation, manufacturing process Automation and control systems, middleware Biomedical and rehabilitation engineering, welfare robotics and mechatronics Cellular Manufacturing Concurrent […]

Aug, 3

4th International Conference on Electrical Energy and Networks (ICEEN), 2016

Publication: Submitted papers can be selected and published into one of the following Journals: * Journal of Advances in Computer Networks (JACN) (ISSN: 1793-8244) EI (INSPEC, IET), Engineering & Technology Digital Library, DOAJ, Electronic Journals Library, Ulrich’s Periodicals Directory, International Computer Science Digital Library (ICSDL), ProQuest, and Google Scholar. * International Journal of Electrical Energy […]

Aug, 3

4th International Conference on System Modeling and Optimization (ICSMO), 2016

Publication: Selected papers of ICSMO 2016 will be published in the International Journal of Modeling and Optimization (ISSN:2010-3697 www.ijmo.org), which will be indexed by Engineering & Technology Digital Library, ProQuest, Crossref, Electronic Journals Library,DOAJ, Google Scholar, EI (INSPEC, IET). Submission Method: Please log in the Electronic Submission System; and submit your paper; http://www.easychair.org/conferences/?conf=icsmo2016 Call for […]

Aug, 1

Parallel Surface Reconstruction on GPU

Marching Cubes is the most frequently used method to reconstruct isosurface from a point cloud. However, the point clouds are getting denser and denser, thus the efficiency of Marching cubes method has become an obstacle. This paper presents a novel GPU-based parallel surface reconstruction algorithm. The algorithm firstly creates a GPU-based uniform grid structure to […]

CUDA

Aug, 1

A University-Industry Collaboration Case Study: Intel Real-Time Multi-View Face Detection Capstone Design Projects

Since 2011, University of Michigan-Shanghai Jiao Tong University Joint Institute (JI) has established 122 corporate-sponsored Capstone Design Projects (CDPs) with world leading companies such as Covidien, General Electric, Hewlett Packard, Intel, and Siemens. Of these corporations, Intel was the first sponsor, having funded 21 projects and mentored 105 students over four consecutive years. This paper […]

OpenCL

Aug, 1

Investigating SRAM PUFs in large CPUs and GPUs

Physically unclonable functions (PUFs) provide data that can be used for cryptographic purposes: on the one hand randomness for the initialization of random-number generators; on the other hand individual fingerprints for unique identification of specific hardware components. However, today’s off-the-shelf personal computers advertise randomness and individual fingerprints only in the form of additional or dedicated […]

CUDA

Jul, 31

8th International Conference on Machine Learning and Computing (ICMLC), 2016

Paper Publication: Paper accepted by ICMLC 2016 will be published into the conference proceeding, which will be included in Elsevier data base. Submission Methods Log in Electronic Submission System (.pdf): http://www.easychair.org/conferences/?conf=icmlc2016 Call for Papers: Adaptive systems Neural net and support vector machine Business intelligence Hybrid and nonlinear system Biometrics Fuzzy set theory, fuzzy control and […]

Jul, 31

3rd International Conference on Advances in Electronics Engineering (ICAEE), 2016

Paper Publication: The paper accepted by ICAEE 2016 will be published in one of the following Journals: *International Journal of Electronics and Electrical Engineering (ISSN: 2301-380X) Abstracting/Indexing: Ulrich’s Periodicals Directory, Google Scholar, EBSCO, Engineering & Technology Digital Library, etc. *International Journal of Information and Electronics Engineering (ISSN: 2010-3719) Abstracting/Indexing: Google Scholar, Electronic Journals Library, Engineering […]

Jul, 31

4th International Conference on Information and Computer Networks (ICICN), 2016

Paper Publication: * Journal of Advances in Computer Networks (ISSN: 1793-8244) Abstracting/ Indexing: EI (INSPEC, IET), Engineering & Technology Digital Library, DOAJ, Electronic Journals Library, Ulrich’s Periodicals Directory, International Computer Science Digital Library (ICSDL), ProQuest, and Google Scholar. * Journal of Advances in Information Technology (ISSN: 1798-2340) Abstracting/Indexing: INSPEC; EBSCO; ULRICH’s Periodicals Directory; WorldCat; CrossRef; […]

Jul, 31

International Conference on Communication and Information Processing (ICCIP), 2015

Topics: • Digital Information Processing and Communications • Access Controls • Anti-cyberterrorism • Assurance of Service • Biometrics Technologies • Cloud Computing • Computational Intelligence • Computer Crime Prevention and Detection • Computer Forensics • Computer Security • Confidentiality Protection • Critical Infrastructure Management • Data Compression • Data Management in Mobile Peer-to-Peer Networks • […]

Jul, 30

GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

While many of the architectural details of future exascale-class high performance computer systems are still a matter of intense research, there appears to be a general consensus that they will be strongly heterogeneous, featuring "standard" as well as "accelerated" resources. Today, such resources are available as multicore processors, graphics processing units (GPUs), and other accelerators […]

CUDA

Jul, 29

Performance Analysis of a Particle-in-Cell Plasma Physics Code on Homogeneous and Heterogeneous HPC Systems

PIC methods are one of the most used methods in plasma simulations. We present a comprehensible evaluation of the PIC code performance on four current parallel platforms: IBM PowerPC, Intel Nehalem (SMP), Intel Sandy Bridge (SMP) and ARM GPU. The behavior of computational algorithms and data structures are analyzed to deduce which code optimizations will […]

OpenCL

high performance computing on graphics processing units: hgpu.org

Posts

2nd International Conference on Mechatronics and Robotics Engineering, 2016

4th International Conference on Electrical Energy and Networks (ICEEN), 2016

4th International Conference on System Modeling and Optimization (ICSMO), 2016

Parallel Surface Reconstruction on GPU

A University-Industry Collaboration Case Study: Intel Real-Time Multi-View Face Detection Capstone Design Projects

Investigating SRAM PUFs in large CPUs and GPUs

8th International Conference on Machine Learning and Computing (ICMLC), 2016

3rd International Conference on Advances in Electronics Engineering (ICAEE), 2016

4th International Conference on Information and Computer Networks (ICICN), 2016

International Conference on Communication and Information Processing (ICCIP), 2015

GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

Performance Analysis of a Particle-in-Cell Plasma Physics Code on Homogeneous and Heterogeneous HPC Systems

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)