SequenceDataset (kelp-full 2.2.2 API)

java.lang.Object
- it.uniroma2.sag.kelp.data.dataset.SimpleDataset
- - it.uniroma2.sag.kelp.data.dataset.SequenceDataset

All Implemented Interfaces:

Dataset
```
public class SequenceDataset
extends SimpleDataset
```
A dataset made of SequenceExamples

Author:

Danilo Croce

Constructor Summary

Constructors
Constructor and Description

SequenceDataset()

Constructors
Constructor and Description
`SequenceDataset()`

Method Summary

Methods
Modifier and Type	Method and Description
`List<Label>`	`getClassificationLabels()` Returns all the classification labels in the dataset.
`List<SequenceExample>`	`getSequenceExamples()`
`void`	`populate(String inputFilePath)` Populate the dataset by reading it from a KeLP compliant file.
`SequenceDataset[]`	`split(float percentage)` Returns two datasets created by splitting this dataset accordingly to `percentage`.
`SequenceDataset[]`	`splitClassDistributionInvariant(float percentage)` Returns two datasets created by splitting this dataset accordingly to `percentage`.

Methods inherited from class it.uniroma2.sag.kelp.data.dataset.SimpleDataset
addExample, addExamples, extractExamplesOfClasses, getExample, getExamples, getNextExample, getNextExamples, getNumberOfExamples, getNumberOfNegativeExamples, getNumberOfPositiveExamples, getRandExample, getRandExamples, getRegressionProperties, getShuffledDataset, getZeroVector, hasNextExample, isConsistent, manipulate, nFolding, nFoldingClassDistributionInvariant, populate, reset, save, setSeed, shuffleExamples

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - SequenceDataset
```
public SequenceDataset()
```
- Method Detail
  - getClassificationLabels
```
public List<Label> getClassificationLabels()
```
    Description copied from interface: Dataset
    
    Returns all the classification labels in the dataset.
    
    Specified by:
    
    getClassificationLabels in interface Dataset
    
    Overrides:
    
    getClassificationLabels in class SimpleDataset
    
    Returns:
    the classification labels in the dataset
  - getSequenceExamples
```
public List<SequenceExample> getSequenceExamples()
```
    Returns:
    The list of sequence of examples in the dataset
  - populate
```
public void populate(String inputFilePath)
              throws IOException,
                     InstantiationException,
                     ParsingExampleException
```
    Description copied from class: SimpleDataset
    
    Populate the dataset by reading it from a KeLP compliant file.
    
    Overrides:
    
    populate in class SimpleDataset
    
    Parameters:
    inputFilePath - the path of the file to be read
    
    Throws:
    
    IOException
    
    InstantiationException
    
    ParsingExampleException
  - split
```
public SequenceDataset[] split(float percentage)
```
    Description copied from class: SimpleDataset
    
    Returns two datasets created by splitting this dataset accordingly to percentage. The examples are split accordingly to their order without maintaining the original data distribution among the classes. Thus the first dataset consists of the first percentage% of examples, while the second dataset consists in all the remaining examples
    
    Overrides:
    
    split in class SimpleDataset
    
    Parameters:
    percentage - should be a number in [0,1]
    
    Returns:
    two datasets generated by splitting this one
  - splitClassDistributionInvariant
```
public SequenceDataset[] splitClassDistributionInvariant(float percentage)
```
    Description copied from class: SimpleDataset
    
    Returns two datasets created by splitting this dataset accordingly to percentage. The original distribution of the examples among the classes is maintained in the two datasets. The examples are split accordingly to their order. Thus the first dataset consists of the first percentage% of examples of each class, while the second dataset consists in all the remaining examples
    
    Overrides:
    
    splitClassDistributionInvariant in class SimpleDataset
    
    Parameters:
    percentage - should be a number in [0,1]
    
    Returns:
    two datasets generated by splitting this one

Class SequenceDataset

Constructor Summary

Method Summary

Methods inherited from class it.uniroma2.sag.kelp.data.dataset.SimpleDataset

Methods inherited from class java.lang.Object

Constructor Detail

SequenceDataset

Method Detail

getClassificationLabels

getSequenceExamples

populate

split

splitClassDistributionInvariant