Data Dashboard- Integrating Data Mining with Data Deduplication

Vitasta Abrol; Jyoti Malhotra

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper

Know more

The week's pick

Enhancing Privacy Preservation: Multi-Attribute Protection with P-Sensitive K-Anonymity

Twinkle Patel Kiran Amin

Random Articles

Interactive Image Segmentation using Color and Texture Features

February

2016

Image Registration using combination of PCA and GPOF Method for Multiframe Super-Resolution

June

2015

Determine Weakest Bus for IEEE 14 Bus Systems

Nov

2018

The SOM Robustness Capacity for Phonemes Recognition in Adverse Environment

December

2012

Reseach Article

Data Dashboard- Integrating Data Mining with Data Deduplication

by Vitasta Abrol, Jyoti Malhotra

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 71 - Number 22

Year of Publication: 2013

Authors: Vitasta Abrol, Jyoti Malhotra

10.5120/12620-9332

Vitasta Abrol, Jyoti Malhotra . Data Dashboard- Integrating Data Mining with Data Deduplication. International Journal of Computer Applications. 71, 22 ( June 2013), 28-33. DOI=10.5120/12620-9332

@article{ 10.5120/12620-9332,

author = { Vitasta Abrol, Jyoti Malhotra },

title = { Data Dashboard- Integrating Data Mining with Data Deduplication },

journal = { International Journal of Computer Applications },

issue_date = { June 2013 },

volume = { 71 },

number = { 22 },

month = { June },

year = { 2013 },

issn = { 0975-8887 },

pages = { 28-33 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume71/number22/12620-9332/ },

doi = { 10.5120/12620-9332 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:36:23.145702+05:30

%A Vitasta Abrol

%A Jyoti Malhotra

%T Data Dashboard- Integrating Data Mining with Data Deduplication

%J International Journal of Computer Applications

%@ 0975-8887

%V 71

%N 22

%P 28-33

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Many applications deal with huge amount of data and that scattered data needs to be transformed into something relevant and meaningful. To make sense of such data is the need of many applications and areas of technology. The data that is already present is very huge, noisy and has a complex structure. We are working on the idea of integrating data mining with data deduplication. Data Dashboard is a tool which can take complex data involving various dimensions and simultaneously uses data deduplication algorithms that help in removing redundancy in the data up to 95%. Thus it provides high reliability, low disk space and high throughput. The tool then uses this data mined information and deploys it on dynamic platform such as web which provides ease to user to access huge database.

References

Ming-Syan Chen, "Data Mining: An Overview from a Database Perspective", IEEE Transactions on Knowledge and Data Engineering, VOL 8, NO. 6.
Michael Steinbach, Levent Ertöz, and Vipin Kumar "The Challenges of Clustering High Dimensional Data".
Mamatha Nadikota, Satya P Kumar Somayajula,Dr. C. P. V. N. J. Mohan Rao ,CSE Department,Avanthi College of Engg &Tech ,Tamaram,Visakhapatnam,A,P. . ,India "Hashing and Pipelining Techniques for Association Rule Mining" International Journal of Computer Science and Information Technologies, Vol. 2 (4), 2011, 1448-1452.
Ying-Hsiang Wen, Jen-Wei Huang, and Ming-Syan Chen,"Hardware-Enhanced Association Rule Mining with Hashing and Pipelining" IEEE Transactions on knowledge and data engineering, VOL. 20, NO. 6, JUNE 2008.
Alissar NASSER, Denis HAMAD, Chaiban NASR, KAIST Ky Ho Park CORE Lab. KAIST "K-means Clustering Algorithm in Projected Spaces".
R. Agrawal and al, "Automatic Clustering of High Dimensional Data used for Data Mining Applications," Proc. ACM SIGMOD conf. , pp. 94- 105, 1998.
N. Mandagere, P. and S. Uttamchandani. "Demystifying Data Deduplication" presented In Companion '08: Proceedings of ACM/IFIP/USENIX Middleware '08, pages 12-17,2008

Index Terms

Computer Science

Information Sciences

Keywords

Data mining Data Deduplication Clustering Hashing API