casasoft
October 7th, 2010, 07:18 PM
We need to implement a learning system which will learn to classify messages into various categories.
We will have various categories in the system and then initially we will manually teach the system how to classify the input text messages.
Initially, during training we would:
1. Input a text message to the system
2. Manually identify the most appropriate categories within which such a message is to fall in an order, for example mostly in Category B, then in Category A and then in Category C.
Training will continue until system learns patterns from the message to classify it into different categories.
The scope of the system is that later on, we would be able to give it an input message and it will automatically tell us which categories fit most for this particular message in a weighted order. So we would input a text message to the system and the system will output the most categories within which the messages goes to, for example, mostly Category D, then Category F and then Category G etc...
We would also like to be able to correct the system manually, so if for a particular message, the system ranks best Category D, followed by Category F and then by Category G, we might want to manually correct it to say that the best is Category F, then Category G and then Category D.
If you need any other clarifications of our requirements, let me know! :)
What is the best algorithm to implement such a learning system? Open for ideas!
Thanks in advance!
We will have various categories in the system and then initially we will manually teach the system how to classify the input text messages.
Initially, during training we would:
1. Input a text message to the system
2. Manually identify the most appropriate categories within which such a message is to fall in an order, for example mostly in Category B, then in Category A and then in Category C.
Training will continue until system learns patterns from the message to classify it into different categories.
The scope of the system is that later on, we would be able to give it an input message and it will automatically tell us which categories fit most for this particular message in a weighted order. So we would input a text message to the system and the system will output the most categories within which the messages goes to, for example, mostly Category D, then Category F and then Category G etc...
We would also like to be able to correct the system manually, so if for a particular message, the system ranks best Category D, followed by Category F and then by Category G, we might want to manually correct it to say that the best is Category F, then Category G and then Category D.
If you need any other clarifications of our requirements, let me know! :)
What is the best algorithm to implement such a learning system? Open for ideas!
Thanks in advance!