Title
Approximate clustering of fingerprint vectors with missing values
Abstract
We study the problem of clustering fingerprints with at most p missing values (CMV (p) for short) naturally arising in oligonucleotide fingerprinting, which is an efficient method for characterizing DNA clone libraries.We show that already CMV(2) is NP-hard. We also show that a greedy algorithm yields a min(1 + ln n, 2+pln l) approximation for CMV(p), and can be implemented to run in O(nl2p) time. Furthermore, we introduce other variants of the problem of clustering fingerprints with at most p missing values based on slightly different optimization criteria and show that they can be approximated in polynomial time with ratios 22p-1 and 2(1 - [EQUATION]), respectively.
Year
Venue
Keywords
2005
symposium on the theory of computing
pln l,different optimization criterion,greedy algorithm yield,oligonu- cleotide ngerprin ting,clustering fingerprint,efficient method,dna clone library,ln n,fingerprint vector,np-hardness,p missing value,oligonucleotide fingerprinting,approximate clustering,polynomial time,approximation algorithms,clustering,greedy algorithm,missing values
Field
DocType
Volume
Approximation algorithm,Discrete mathematics,Combinatorics,Greedy algorithm,Fingerprint,Missing data,Time complexity,Cluster analysis,Mathematics
Journal
abs/cs/0511082
ISBN
Citations 
PageRank 
1-920682-23-6
4
0.58
References 
Authors
1
6
Name
Order
Citations
PageRank
Andres Figueroa1424.27
Avraham Goldstein252.29
Jiang, Tao333914.58
Maciej Kurowski4786.25
A. Lingas525045.63
Mia Persson6307.17