high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » The Anatomy of High-Performance 2D Similarity Calculations

The Anatomy of High-Performance 2D Similarity Calculations

Imran S. Haque, Vijay S. Pande, W. Patrick Walters

Department of Computer Science, Stanford University, Stanford, California 94305, United States

Journal of Chemical Information and Modeling, 2011, 51 (9), pp 2345-2351

DOI:10.1021/ci200235e

@article{haque2011anatomy,

title={The Anatomy of High-Performance 2D Similarity Calculations},

author={Haque, I.S. and Pande, V.S. and Walters, W.P.},

journal={Journal of Chemical Information and Modeling},

year={2011},

publisher={ACS Publications}

}

Download (PDF)

View

Source

Source codes

Package:

Anatomy of High-Performance 2D Similarity Calculations

2231

views

Similarity measures based on the comparison of dense bit vectors of two-dimensional chemical features are a dominant method in chemical informatics. For large-scale problems, including compound selection and machine learning, computing the intersection between two dense bit vectors is the overwhelming bottleneck. We describe efficient implementations of this primitive as well as example applications using features of modern CPUs that allow 20-40x performance increases relative to typical code. Specifically, we describe fast methods for population count on modern x86 processors and cache-efficient matrix traversal and leader clustering algorithms that alleviate memory bandwidth bottlenecks in similarity matrix construction and clustering. The speed of our 2D comparison primitives is within a small factor of that obtained on GPUs and does not require specialized hardware.

Tags: Algorithms, Biochemistry, Chemistry, Clustering, CUDA, nVidia, nVidia GeForce GTX 260, nVidia GeForce GTX 480, Package

October 16, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

The Anatomy of High-Performance 2D Similarity Calculations

Package:

Your response

Recent source codes

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Most viewed papers (last 30 days)

The Anatomy of High-Performance 2D Similarity Calculations

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)