Manuscript Number : IJSRSET1844348
Security Problem Detection of Hidden Data in Unstructured Log Messages With a Novel Text Mining Technique
Authors(2) :-Paridha Oza, Mr. Premkumar
Text mining is an area where ample possibilites of research is being opened because of large data being shared everyday with the use of applicaitions and social media, both online and offline. Security here is a concern while passing textual infromation in such ways. Further more the logs generated with these data has all the important information in it. In text mining the wide area of text analysis contains machine learning which is further inherited from artificial intelligence. In text mining applications, information from Document is present in the form of Text along with Side Information or Metadata. XML documents found as RSS feed and other links generated for web pages have title of the document, author name or date of Publication which are present in the text document. Such metadata may possess a lot of information for the clustering purposes. Here there are possibilities of delievering unclear data. Using metadata for analysing information without filtering it, can result to lower security as well as data leakage. To improvise the results, we have used an efficient Feature Selection method to perform the mining process to select the metadata or xml tags which is useful for Clustering so as to maximize the advantages from using it. In my research, I have added process to find words that are not commonly being processed. I am expecting to get better results than the earlier researches by modifying the process.
Paridha Oza
Security,Text Mining,Unstructured data,IARPA Dataset,clusters,Threat Detection
Publication Details
Published in :
Volume 4 | Issue 4 | March-April 2018 Article Preview
Computer Engineering Department, Silver Oak College of Engineering & Technology, Ahmedabad, Gujarat, India
Mr. Premkumar
Computer Engineering Department, Silver Oak College of Engineering & Technology, Ahmedabad, Gujarat, India
Date of Publication :
2018-04-30
License: This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) :
1022-1025
Manuscript Number :
IJSRSET1844348
Publisher : Technoscience Academy
Journal URL :
https://ijsrset.com/IJSRSET1844348