• Thumbnail for Density-based clustering validation
    Density-Based Clustering Validation (DBCV) is a metric designed to assess the quality of clustering solutions, particularly for density-based clustering...
    8 KB (908 words) - 17:32, 22 June 2025
  • Thumbnail for Cluster analysis
    the kernel density estimate, which results in over-fragmentation of cluster tails. Density-based clustering examples Density-based clustering with DBSCAN...
    75 KB (9,513 words) - 02:05, 30 April 2025
  • have a low or negative value, then the clustering configuration may have too many or too few clusters. A clustering with an average silhouette width of over...
    14 KB (2,220 words) - 20:29, 20 June 2025
  • Automatic clustering algorithms are algorithms that can perform clustering without prior knowledge of data sets. In contrast with other cluster analysis...
    13 KB (1,625 words) - 12:02, 20 May 2025
  • Thumbnail for Kernel density estimation
    a non-parametric method to estimate the probability density function of a random variable based on kernels as weights. KDE answers a fundamental data...
    39 KB (4,618 words) - 09:26, 6 May 2025
  • k-means clustering is a method of vector quantization, originally from signal processing, that aims to partition n observations into k clusters in which...
    62 KB (7,754 words) - 11:44, 13 March 2025
  • Thumbnail for Density estimation
    population. A variety of approaches to density estimation are used, including Parzen windows and a range of data clustering techniques, including vector quantization...
    10 KB (1,305 words) - 19:10, 1 May 2025
  • be validated before real use with an unseen data (validation set). "The literature on machine learning often reverses the meaning of 'validation' and...
    20 KB (2,212 words) - 08:39, 27 May 2025
  • Thumbnail for ELKI
    Hierarchical clustering (including the fast SLINK, CLINK, NNChain and Anderberg algorithms) Single-linkage clustering Leader clustering DBSCAN (Density-Based Spatial...
    19 KB (2,106 words) - 07:06, 8 January 2025
  • Thumbnail for Learning curve (machine learning)
    Model-Based Clustering". Journal of Machine Learning Research. 2 (3): 397. Archived from the original on 2013-07-15. scikit-learn developers. "Validation curves:...
    6 KB (749 words) - 04:41, 26 May 2025
  • Thumbnail for Microarray analysis techniques
    analysis. Hierarchical clustering is a statistical method for finding relatively homogeneous clusters. Hierarchical clustering consists of two separate...
    31 KB (3,567 words) - 04:54, 11 June 2025
  • feature engineering has been clustering of feature-objects or sample-objects in a dataset. Especially, feature engineering based on matrix decomposition has...
    20 KB (2,183 words) - 06:29, 26 May 2025
  • Thumbnail for Time series
    series data may be clustered, however special care has to be taken when considering subsequence clustering. Time series clustering may be split into whole...
    43 KB (5,025 words) - 15:47, 14 March 2025
  • Bootstrapping Cross validation Jackknife Permutation tests rely on resampling the original data assuming the null hypothesis. Based on the resampled data...
    18 KB (2,236 words) - 09:36, 16 March 2025
  • Thumbnail for Cross-validation (statistics)
    Cross-validation, sometimes called rotation estimation or out-of-sample testing, is any of various similar model validation techniques for assessing how...
    44 KB (5,781 words) - 09:14, 19 February 2025
  • Thumbnail for Friedmann equations
    geometry of the universe as a function of the fluid density. Relativisitic cosmology models based on the FLRW metric and obeying the Friedmann equations...
    30 KB (4,441 words) - 17:48, 20 June 2025
  • Hierarchical clustering Single-linkage clustering Conceptual clustering Cluster analysis BIRCH DBSCAN Expectation–maximization (EM) Fuzzy clustering Hierarchical...
    39 KB (3,386 words) - 19:51, 2 June 2025
  • Thumbnail for Regression analysis
    correlation coefficient Quasi-variance Prediction interval Regression validation Robust regression Segmented regression Signal processing Stepwise regression...
    37 KB (5,235 words) - 03:23, 20 June 2025
  • Thumbnail for Overfitting
    overfitting, several techniques are available (e.g., model comparison, cross-validation, regularization, early stopping, pruning, Bayesian priors, or dropout)...
    25 KB (2,843 words) - 18:52, 18 April 2025
  • Thumbnail for T-distributed stochastic neighbor embedding
     188–203. doi:10.1007/978-3-319-68474-1_13. "K-means clustering on the output of t-SNE". Cross Validated. Retrieved 2018-04-16. Wattenberg, Martin; Viégas...
    15 KB (2,065 words) - 01:25, 24 May 2025
  • Thumbnail for Biological network inference
    Cluster analysis algorithms come in many forms as well such as Hierarchical clustering, k-means clustering, Distribution-based clustering, Density-based...
    33 KB (3,831 words) - 22:35, 29 June 2024
  • Thumbnail for Structural bioinformatics
    can be used for clustering protein signatures, detecting protein-ligand interactions, predicting ΔΔG, and proposing mutations based on Euclidean distance...
    35 KB (3,532 words) - 23:28, 22 May 2024
  • Cross-validation/Train/Test split (must fit MinMax/ngrams/etc on only the train split, then transform the test set) Duplicate rows between train/validation/test...
    9 KB (1,027 words) - 22:44, 12 May 2025
  • cross-validation to select the best model from a bucket of models. Likewise, the results from BMC may be approximated by using cross-validation to select...
    53 KB (6,691 words) - 14:14, 8 June 2025
  • text feature Task detection; e.g., binary classification, regression, clustering, or ranking Feature engineering Feature selection Feature extraction Meta-learning...
    9 KB (1,046 words) - 02:47, 26 May 2025
  • Thumbnail for Cluster sampling
    observations per cluster is fixed at n. Below, V c ( β ) {\displaystyle V_{c}(\beta )} stands for the covariance matrix adjusted for clustering, V ( β ) {\displaystyle...
    16 KB (2,332 words) - 04:09, 13 December 2024
  • Thumbnail for Median
    maximising the distance between cluster-means that is used in k-means clustering, is replaced by maximising the distance between cluster-medians. This is a method...
    63 KB (8,010 words) - 23:47, 14 June 2025
  • spectral density estimation (SDE) or simply spectral estimation is to estimate the spectral density (also known as the power spectral density) of a signal...
    24 KB (3,548 words) - 15:32, 18 June 2025
  • of unsupervised machine learning include clustering, dimensionality reduction, and density estimation. Cluster analysis is the assignment of a set of observations...
    140 KB (15,572 words) - 23:09, 20 June 2025
  • Sabine; Leese, Morven; and Stahl, Daniel (2011) "Miscellaneous Clustering Methods", in Cluster Analysis, 5th Edition, John Wiley & Sons, Ltd., Chichester...
    32 KB (4,333 words) - 23:48, 16 April 2025