Convex Clustering: An Attractive Alternative to Hierarchical Clustering

hgpu.org » Programming » Algorithms » Convex Clustering: An Attractive Alternative to Hierarchical Clustering

Convex Clustering: An Attractive Alternative to Hierarchical Clustering

Gary K. Chen, Eric Chi, John Ranola, Kenneth Lange

Department of Preventive Medicine (Biostatistics Division), University of Southern California, CA

arXiv:1409.2065 [q-bio.GN], (6 Sep 2014)

@article{2014arXiv1409.2065C,

author={Chen}, G.~K. and {Chi}, E. and {Ranola}, J. and {Lange}, K.},

title={"{Convex Clustering: An Attractive Alternative to Hierarchical Clustering}"},

journal={ArXiv e-prints},

archivePrefix={"arXiv"},

eprint={1409.2065},

primaryClass={"q-bio.GN"},

keywords={Quantitative Biology – Genomics},

year={2014},

month={sep},

adsurl={http://adsabs.harvard.edu/abs/2014arXiv1409.2065C},

adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

Download (PDF)

View

Source

Source codes

Package:

CONVEXCLUSTER: MM algorithms using the proximal distance algorithm

3305

views

The primary goal in cluster analysis is to discover natural groupings of objects. The field of cluster analysis is crowded with diverse methods that make special assumptions about data and address different scientific aims. Despite its shortcomings in accuracy, hierarchical clustering is the dominant clustering method in bioinformatics. Biologists find the trees constructed by hierarchical clustering visually appealing and in tune with their evolutionary perspective. Hierarchical clustering operates on multiple scales simultaneously. This is essential, for instance, in transcriptome data where one may be interested in making qualitative inferences about how lower-order relationships like gene modules lead to higher-order relationships like pathways or biological processes. The recently developed method of convex clustering preserves the visual appeal of hierarchical clustering while ameliorating its propensity to make false inferences in the presence of outliers and noise. The current paper exploits the proximal distance principle to construct a novel algorithm for solving the convex clustering problem. The solution paths generated by convex clustering reveal relationships between clusters that are hidden by static methods such as k-means clustering. Our convex clustering software separates parameters, accommodates missing data, and supports prior information on relationships. The software is implemented on ATI and nVidia graphics processing units (GPUs) for maximal speed. Several biological examples illustrate the strengths of convex clustering and the ability of the proximal distance algorithm to handle high-dimensional problems.

Tags: Algorithms, Bioinformatics, Biology, Cluster analysis, Clustering, Genomics, Hierarchical clustering, nVidia, OpenCL, Package, Tesla C2050

September 9, 2014 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org