13964

Posts

May, 13

A Survey of CPU-GPU Heterogeneous Computing Techniques

As both CPU and GPU become employed in a wide range of applications, it has been acknowledged that both of these processing units (PUs) have their unique features and strengths and hence, CPU-GPU collaboration is inevitable to achieve high-performance computing. This has motivated significant amount of research on heterogeneous computing techniques, along with the design […]
May, 13

CUDA 7 Performance Overview webinar

CUDA 7 Toolkit has lots of new features – and also many performance enhancements. Ujval Kapasi is Director, CUDA Product Management at NVIDIA. Ujval received his Ph.D. in Electrical Engineering from Stanford University and his Bachelor of Science in Engineering from Brown University. Download slides (PDF) View slides (PDF) via Google Docs
May, 12

7th International Conference on Computer Technology and Development (ICCTD 2015), 2015

Submission Deadline: 2015-07-10 Topics: A1: Algorithms B1: Communication Networks A2: Bioinformatics B2: Wireless Communications A3: Computer Simulation B3: Mobile Communications A4: Control Systems B4: Infrastructure for Next Generation Networks A5: Data Mining B5: Information & Communication A6: Expert Systems B6: Coding Theory A7: Image Processing B7: Optical Communications A8: Multimedia B8: Internet Technologies A9: Natural […]
May, 12

4th International Conference on Communication and Broadband Networking (ICCBN 2015), 2015

Submission Deadline: 2015-07-10 Topics: • Wireless Communications and Networking • Multimedia Networking • Signal Processing for Communications • Networking Algorithms and Performance Evaluation • Wireless Sensor Networks • Communication and Information Theory • Network Security • Cognitive Radio Networks • Internet Applications • Protocols and Algorithms • Coding Theory • 3G & 4G Mobile Communication […]
May, 12

International Conference on Systems, Control and Communications (ICSCC), 2015

Submission Deadline: 2015-07-10 Topics: Information-based control systems Distributed and cooperative control systems Networked control systems (NCS) Wired and wireless networks Network control (admission/flow/congestion control, etc.) Network scheduling and bandwidth allocation Informatics in control and communication Cyber-physical systems (CPSs) Sensor and actuator networks Multi-agent systems Case studies and applications For more topics: http://www.icscc.org/cfp.html Publication: All accepted […]
May, 12

Improving CUDA DNA Analysis Software with Genetic Programming

We genetically improve BarraCUDA using a BNF grammar incorporating C scoping rules with GP. Barracuda maps next generation DNA sequences to the human genome using the Burrows-Wheeler algorithm (BWA) on nVidia Tesla parallel graphics hardware (GPUs). GI using phenotypic tabu search with manually grown code can graft new features giving more than 100 fold speed […]
May, 12

CVPI: A Computer Vision Library For Mobile and Embedded Platforms

CVPI is a library for implementing computer vision programs on computers supporting OpenVG. It adds additional image processing capabilities to OpenVG that are necessary for computer vision, as well a as providing an interface to setup the rendering environment. OpenVG is a hardware accelerated C API for vector and raster 2D graphics. It is widely […]
May, 12

Development of Parallel Architectures for Radar/Video Signal Processing Applications

The applications of digital signal processing continue to expand and use in many different areas such as signal processing, radar tracking, image processing, medical imaging, video broadcasting, and control algorithms for sensor array processing. Most of the signal processing applications are intensive and may not achieve the real time requirements. However, the Multi-core phenomenon has […]
May, 12

Density Estimations for Approximate Query Processing on SIMD Architectures

Approximate query processing (AQP) is an interesting alternative for exact query processing. It is a tool for dealing with the huge data volumes where response time is more important than perfect accuracy (this is typically the case during initial phase of data exploration). There are many techniques for AQP, one of them is based on […]
May, 12

FPGA-Based Design of Numerical Algorithms for Kernel Density Estimation Using High Level Synthesis Approach

FPGA technology can offer significantly higher performance at much lower power than is available from CPUs and GPUs in many computational problems. Unfortunately, programming for FPGA (using hardware description languages, HDL) is a difficult and not-trivial task and is not intuitive for C/C++/Java programmers. To bring the gap between programming effectiveness and difficulty the High […]
May, 10

Age and Gender Classification using Convolutional Neural Networks

Automatic age and gender classification has become relevant to an increasing amount of applications, particularly since the rise of social platforms and social media. Nevertheless, performance of existing methods on real-world images is still significantly lacking, especially when compared to the tremendous leaps in performance recently reported for the related task of face recognition. In […]
May, 10

Numerical Simulation of Melting with Natural Convection Based on Lattice Boltzmann Method and Performed with CUDA Enabled GPU

A new solver is developed to numerically simulate the melting phase change with natural convection. This solver was implemented on a single Nvidia GPU based on the CUDA technology in order to simulate the melting phase change in a 2D rectangular enclosure. The Rayleigh number is of the order of magnitude of 108 and Prandlt […]
Page 5 of 805« First...34567...102030...Last »

* * *

* * *

Like us on Facebook

HGPU group

243 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1474 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: