A nascent approach to mine outliers using compression

Vashisht, Swati; Gupta, Shubhi; Mani, Atul

Volume 7, Issue 3, August 2014, Pages 1034–1037

A nascent approach to mine outliers using compression

BibTex | RIS | EndNote | RefWorks

@article{IJIAS-14-187-06,
author = {Swati Vashisht and Shubhi Gupta and Atul Mani},
title = {{A nascent approach to mine outliers using compression}},
journal = {International Journal of Innovation and Applied Studies},
volume = {7},
year = {2014},
pages = {1034--1037},
issue = {3},
number = {3},
issn = {2028-9324},
url = {http://www.ijias.issr-journals.org/abstract.php?article=IJIAS-14-187-06},
abstract_html_url = {http://www.ijias.issr-journals.org/abstract.php?article=IJIAS-14-187-06},
pdf_url = {http://www.issr-journals.org/links/papers.php?journal=ijias&application=pdf&article=IJIAS-14-187-06},
document_type={Article},
source={www.issr-journals.org}
}

TY  - JOUR
ID  - 
TI  - A nascent approach to mine outliers using compression
AU  - Swati Vashisht
AU  - Shubhi Gupta
AU  - Atul Mani
PY  - 2014
VL  - 7
IS  - 3
SP  - 1034
EP  - 1037
JO  - International Journal of Innovation and Applied Studies
T2  - International Journal of Innovation and Applied Studies
SN  - 20289324
UR  - http://www.ijias.issr-journals.org/abstract.php?article=IJIAS-14-187-06
AB  - Outlier mining is concerned with the data objects that do not comply with the general behavior or model of the data, such data Objects, which are either different from or inconsistent with the remaining set of data. Studying the extra ordinary behavior of outliers helps uncovering the knowledge hidden behind them and providing an approach to the decision makers to make profit or improve the service quality. Hence, mining for outliers is an important data mining research with numerous applications, including credit card fraud detection, criminal activities in E-commerce, unusual usages of telecommunication services, Weather Forecasting etc. Moreover, it is useful in digital and customized marketing for identifying the spending behavior of customers with extremely low or extremely high incomes, or in medical diagnose for finding unusual results to various medical treatments.    Some data mining techniques discard outliers as noise or exceptions. While in some applications, these exceptions are considered more interesting than regularly occurring ones like in terrorism attack. Challenges in outlier detection include finding appropriate data models, the dependence of outlier detection systems on the application involved, finding techniques to distinguish outliers from error or exception, and providing justification for identification outliers. Outliers can be detected through N-gram technique but this technique is using a large storage space to store metadata and data dictionary. There are a number of compression models e.g. Content tree weighting method, LZ77, LZ78, LZW that are used in compressing text and image. Burrows
ER  -

TY  - JOUR
ID  - 
TI  - A nascent approach to mine outliers using compression
AU  - Swati Vashisht
AU  - Shubhi Gupta
AU  - Atul Mani
PY  - 2014
VL  - 7
IS  - 3
SP  - 1034
EP  - 1037
JO  - International Journal of Innovation and Applied Studies
SN  - 20289324
AB  - 
Outlier mining is concerned with the data objects that do not comply with the general behavior or model of the data, such data Objects, which are either different from or inconsistent with the remaining set of data. Studying the extra ordinary behavior of outliers helps uncovering the knowledge hidden behind them and providing an approach to the decision makers to make profit or improve the service quality. Hence, mining for outliers is an important data mining research with numerous applications, including credit card fraud detection, criminal activities in E-commerce, unusual usages of telecommunication services, Weather Forecasting etc. Moreover, it is useful in digital and customized marketing for identifying the spending behavior of customers with extremely low or extremely high incomes, or in medical diagnose for finding unusual results to various medical treatments.    Some data mining techniques discard outliers as noise or exceptions. While in some applications, these exceptions are considered more interesting than regularly occurring ones like in terrorism attack. Challenges in outlier detection include finding appropriate data models, the dependence of outlier detection systems on the application involved, finding techniques to distinguish outliers from error or exception, and providing justification for identification outliers. Outliers can be detected through N-gram technique but this technique is using a large storage space to store metadata and data dictionary. There are a number of compression models e.g. Content tree weighting method, LZ77, LZ78, LZW that are used in compressing text and image. Burrows
ER  -

RT Journal Article
ID IJIAS-14-187-06
A1 Swati Vashisht
A1 Shubhi Gupta
A1 Atul Mani
YR 2014
T1 A nascent approach to mine outliers using compression
JF International Journal of Innovation and Applied Studies

Download

Swati Vashisht¹, Shubhi Gupta², and Atul Mani³

¹ Computer Science, Amity Group of Institutions, U.P., India
² Computer Science, Amity Group of Institutions, U.P., India
³ Mechanical Engineering, RKGEC, U.P., India

Original language: English

Copyright © 2014 ISSR Journals. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Outlier mining is concerned with the data objects that do not comply with the general behavior or model of the data, such data Objects, which are either different from or inconsistent with the remaining set of data. Studying the extra ordinary behavior of outliers helps uncovering the knowledge hidden behind them and providing an approach to the decision makers to make profit or improve the service quality. Hence, mining for outliers is an important data mining research with numerous applications, including credit card fraud detection, criminal activities in E-commerce, unusual usages of telecommunication services, Weather Forecasting etc. Moreover, it is useful in digital and customized marketing for identifying the spending behavior of customers with extremely low or extremely high incomes, or in medical diagnose for finding unusual results to various medical treatments.
Some data mining techniques discard outliers as noise or exceptions. While in some applications, these exceptions are considered more interesting than regularly occurring ones like in terrorism attack. Challenges in outlier detection include finding appropriate data models, the dependence of outlier detection systems on the application involved, finding techniques to distinguish outliers from error or exception, and providing justification for identification outliers. Outliers can be detected through N-gram technique but this technique is using a large storage space to store metadata and data dictionary. There are a number of compression models e.g. Content tree weighting method, LZ77, LZ78, LZW that are used in compressing text & image. Burrows

Author Keywords: Outliers, Compression, N-gram technique, weighting methods, storage space.

How to Cite this Article

Swati Vashisht, Shubhi Gupta, and Atul Mani, “A nascent approach to mine outliers using compression,” International Journal of Innovation and Applied Studies, vol. 7, no. 3, pp. 1034–1037, August 2014.

About IJIAS

News

Submission

Downloads

Archives

Custom Search

Contact

Connect with IJIAS

A nascent approach to mine outliers using compression

Abstract

How to Cite this Article