The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Mining Sequential Association rules / Sequential Pattern Mining
SunnyLotusFlowe
Member Posts: 37 Contributor II
in Help
Hello all again,
I am stitting on a task right now and have a problem. I need to do Sequential Pattern Mining and wanted to know what the most used Operator for this task are.
I would be glad if someone could give me a tip .
greetings
SunnyLotusFlower
I am stitting on a task right now and have a problem. I need to do Sequential Pattern Mining and wanted to know what the most used Operator for this task are.
I would be glad if someone could give me a tip .
greetings
SunnyLotusFlower
0
Answers
greetings
SunnyLotusFlower
Another way would be to use recurrent neural networks with a delay, this last option is not possible in rapid miner.
greetings
SunnyLotusFlower
Can you give me a sequential pattern to analyze?
Customer-ID, Product, Date
10150, softdrink, 1.5.2010
10150, fruitveg, 1.5.2010
10236, frozenmeal, 1.5.2010
10236, beer, 15.5.2010
10360, fish, 21.6.2010
10360, cannedveg, 21.6.2010
10360, beer, 26.6.2010
And i need Association Rules like
"If Customer A buys fish and cannedveg on 21.6.2010 , then he will buy beer on 26.6.2010.
Fish and cannedveg on 21.6 => beer on 26.6
if u have missunderstood me , i must appologize for that.
greetings
Lotus
"If persons buys some things at sometime, how will this effect his buying in the future?"
A more specific hypothesis:
"If persons buys some things at sometime, how will this effect his buying the next time he enters the shop?"
This problem is a little less hard, but more manageable then the first.
To solve this problem I would convert the data.
edit: This might be possible in rapid miner using the windowing operator, but it is tricky
ID, softdrink, fruitveg, forzenmeal, fish, cannedveg, beer, softdrink2, fruitveg2, forzenmeal2, fish2, cannedveg2, beer2
10150, 1, 1, 0, 0, 0, 0, ?, ?, ?, ?, ?, ? (this guy buy softdrink and fruitveg, no info for next time)
10236, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1 (this guy buy frozen meal, next time beer)
10236, 0, 0, 0, 0, 0, 1, ?, ?, ?, ?, ?, ? (same as last entry, but no info on next next time)
10360, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 1 (this guy buy fish and canned veg, and next time buy beer)
10360, 0, 0, 0, 0, 0, 1, ?, ?, ?, ?, ?, ? (same as last entry, but again no next next info)
(You did not give me much data, so you get a lot of ? symbols)
You can run any unsupervised learning algorithm on this data.
If you want to solve the "If persons buys some things at sometime, how will this effect his buying in the future?" problem,
you will get many more attributes in your dataset, it is possible, but unlikely to yield good results.
edit:
you might want to also add the attribute "number of days since last visit"
to account for the fact that shop visits do not occur at equal intervals.
"If persons buys some things at sometime, how will this effect his buying the next time he enters the shop?"
Do u do this conversion with the Windowing operator?
If i understand it correkt: for every new date u get a new basket. Is that correct?
and then i should perform a FP-Growth on that data?
Could u get me a Workflow for this ? i never used the windowing-operator...
greetings Lotus
Maybe this code can help, from com.rapidminer.gui.templates.Template@320a80db (market basket)
but the problem is i dont got much time to work on this further.
Maybe i find some time later for do this ....
thanks alot
greetings User
Ramakrishnan Srikant, Rakesh Agrawal (1996). Mining Sequential Patterns: Generalizations and Performance Improvements.
What should be the input to rapidminer?
Like figure 1, or like figure 2, or other?
http://img441.imageshack.us/img441/9206/inputx.jpg
greetings
Lotus
______________________
@ B_Miner
the problem is i should use additional algorithms from weka. I only can use the algorithms from RapidMiner (this comes from the task).
and now it looks like rapidminer cant do a sequential pattern analysis....
but thx for the tip