|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object mulan.data.Statistics
public class Statistics
Class for calculating statistics of a multi-label dataset. For more information, see
Tsoumakas, Grigorios, Katakis, Ioannis, Vlahavas, Ioannis: Mining Multi-Label Data. In Maimon, Oded and Rokach, Lior, editors, Data Mining and Knowledge Discovery Handbook, 667-685, 2010.
@incollection{Tsoumakas2010, author = {Tsoumakas, Grigorios and Katakis, Ioannis and Vlahavas, Ioannis}, booktitle = {Data Mining and Knowledge Discovery Handbook}, edition = {2nd}, editor = {Maimon, Oded and Rokach, Lior}, pages = {667-685}, publisher = {Springer}, title = {Mining Multi-Label Data}, year = {2010} }
Constructor Summary | |
---|---|
Statistics()
|
Method Summary | |
---|---|
double[][] |
calculateCoocurrence(MultiLabelInstances mdata)
This method calculates and prints a matrix with the coocurrences of pairs of labels |
double[][] |
calculatePhi(MultiLabelInstances dataSet)
Calculates phi correlation |
void |
calculateStats(MultiLabelInstances mlData)
calculates various multilabel statistics, such as label cardinality, label density and the set of distinct labels along with their frequency |
double |
cardinality()
returns the label cardinality of the dataset |
double |
density()
returns the label density of the dataset |
double[] |
getPhiHistogram()
Calculates a histogram of phi correlations |
TechnicalInformation |
getTechnicalInformation()
Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on. |
String |
globalInfo()
Returns a string describing this class. |
HashMap<LabelSet,Integer> |
labelCombCount()
returns the HashMap containing the distinct labelsets and their frequencies |
int |
labelFrequency(LabelSet x)
returns the frequency of a labelset in the dataset |
Set<LabelSet> |
labelSets()
returns a set with the distinct labelsets of the dataset |
void |
printPhiCorrelations()
Prints out phi correlations |
void |
printPhiDiagram(double step)
This method prints data, useful for the visualization of Phi per dataset. |
double[] |
priors()
returns the prior probabilities of the labels |
int[] |
topPhiCorrelatedLabels(int labelIndex,
int k)
Returns the indices of the labels that have the strongest phi correlation with the label which is given as a parameter. |
String |
toString()
returns various multilabel statistics in textual representation |
int[] |
uncorrelatedLabels(int labelIndex,
double bound)
returns the indices of the labels whose phi coefficient values lie between -bound <= phi <= bound |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public Statistics()
Method Detail |
---|
public HashMap<LabelSet,Integer> labelCombCount()
public double[][] calculateCoocurrence(MultiLabelInstances mdata)
mdata
- a multi-label data set
public void calculateStats(MultiLabelInstances mlData)
mlData
- a multi-label datasetpublic double[][] calculatePhi(MultiLabelInstances dataSet) throws Exception
dataSet
- a multi-label dataset
Exception
public void printPhiCorrelations()
public double[] getPhiHistogram()
public int[] uncorrelatedLabels(int labelIndex, double bound)
labelIndex
- bound
-
public int[] topPhiCorrelatedLabels(int labelIndex, int k)
labelIndex
- k
-
public void printPhiDiagram(double step)
step
- the phi value increment steppublic String toString()
toString
in class Object
public double[] priors()
public double cardinality()
public double density()
public Set<LabelSet> labelSets()
public int labelFrequency(LabelSet x)
x
- a labelset
public TechnicalInformation getTechnicalInformation()
getTechnicalInformation
in interface TechnicalInformationHandler
public String globalInfo()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |