CFPGrowthPlus
- class PAMI.multipleMinimumSupportBasedFrequentPattern.basic.CFPGrowthPlus.CFPGrowthPlus(iFile, MIS, sep='\t')[source]
Bases:
_frequentPatterns
- Description:
- Reference:
R. Uday Kiran P. Krishna Reddy Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms. 11-20 2011 EDBT https://doi.org/10.1145/1951365.1951370
- Parameters:
iFile – str : Name of the Input file to mine complete set of Uncertain Multiple Minimum Support Based Frequent patterns
oFile – str : Name of the output file to store complete set of Uncertain Minimum Support Based Frequent patterns
minSup – str: minimum support thresholds were tuned to find the appropriate ranges in the limited memory
sep – str : This variable is used to distinguish items from one another in a transaction. The default seperator is tab space. However, the users can override their default separator.
- Attributes:
- iFilefile
Input file name or path of the input file
- MIS: file or dictionary
Multiple minimum supports of all items in the database
- sepstr
This variable is used to distinguish items from one another in a transaction. The default separator is tab space or . However, the users can override their default separator.
- oFilefile
Name of the output file or the path of the output file
- startTime:float
To record the start time of the mining process
- endTime:float
To record the completion time of the mining process
- memoryUSSfloat
To store the total amount of USS memory consumed by the program
- memoryRSSfloat
To store the total amount of RSS memory consumed by the program
- Databaselist
To store the transactions of a database in list
- mapSupportDictionary
To maintain the information of item and their frequency
- lnoint
it represents the total no of transactions
- treeclass
it represents the Tree class
- finalPatternsdict
it represents to store the patterns
- Methods:
- mine()
Mining process will start from here
- getPatterns()
Complete set of patterns will be retrieved with this function
- savePatterns(oFile)
Complete set of frequent patterns will be loaded in to a output file
- getPatternsAsDataFrame()
Complete set of frequent patterns will be loaded in to a dataframe
- getMemoryUSS()
Total amount of USS memory consumed by the mining process will be retrieved from this function
- getMemoryRSS()
Total amount of RSS memory consumed by the mining process will be retrieved from this function
- getRuntime()
Total amount of runtime taken by the mining process will be retrieved from this function
- creatingItemSets()
Scans the dataset or dataframes and stores in list format
- frequentOneItem()
Extracts the one-frequent patterns from transactions
Executing the code on terminal:
Format: (.venv) $ python3 CFPGrowthPlus.py <inputFile> <outputFile> Examples: (.venv) $ python3 CFPGrowthPlus.py sampleDB.txt patterns.txt MISFile.txt .. note:: minSup will be considered in support count or frequency
Sample run of the importing code:
from PAMI.multipleMinimumSupportBasedFrequentPattern.basic import CFPGrowthPlus as alg obj = alg.CFPGrowthPlus(iFile, mIS) obj.startMine() frequentPatterns = obj.getPatterns() print("Total number of Frequent Patterns:", len(frequentPatterns)) obj.savePatterns(oFile) Df = obj.getPatternInDataFrame() memUSS = obj.getMemoryUSS() print("Total Memory in USS:", memUSS) memRSS = obj.getMemoryRSS() print("Total Memory in RSS", memRSS) run = obj.getRuntime() print("Total ExecutionTime in seconds:", run)
Credits:
The complete program was written by P.Likhitha under the supervision of Professor Rage Uday Kiran.
- getMemoryRSS()[source]
Total amount of RSS memory consumed by the mining process will be retrieved from this function
- Returns:
returning RSS memory consumed by the mining process
- Return type:
float
- getMemoryUSS()[source]
Total amount of USS memory consumed by the mining process will be retrieved from this function
- Returns:
returning USS memory consumed by the mining process
- Return type:
float
- getPatterns()[source]
Function to send the set of frequent patterns after completion of the mining process
- Returns:
returning frequent patterns
- Return type:
dict
- getPatternsAsDataFrame()[source]
Storing final frequent patterns in a dataframe
- Returns:
returning frequent patterns in a dataframe
- Return type:
pd.DataFrame
- getRuntime()[source]
Calculating the total amount of runtime taken by the mining process
- Returns:
returning total amount of runtime taken by the mining process
- Return type:
float