Increasing the stability through the preprocessing anomalous objects in a given data | Статья в журнале «Молодой ученый»

Отправьте статью сегодня! Журнал выйдет 30 ноября, печатный экземпляр отправим 4 декабря.

Опубликовать статью в журнале

Авторы: ,

Рубрика: Социология

Опубликовано в Молодой учёный №28 (132) декабрь 2016 г.

Дата публикации: 14.12.2016

Статья просмотрена: 27 раз

Библиографическое описание:

Матлатипов, Г. Р. Increasing the stability through the preprocessing anomalous objects in a given data / Г. Р. Матлатипов, Ж. М. Маттиев. — Текст : непосредственный // Молодой ученый. — 2016. — № 28 (132). — С. 790-794. — URL: https://moluch.ru/archive/132/36651/ (дата обращения: 16.11.2024).



Different types of features in the description of objects does not allow to use as a tool for the study of methods of statistical exploratory data analysis. To solve this problem it is offered to use the methods of data mining oriented on search of the hidden regularities in databases.

One of the directions of the intellectual analysis is classification. The considerable volume of information at the solution of problems of classification represents knowledge for structural placement of class objects and complexity of a configuration in borders of classes.

Data on structural placement of objects of classes in feature space for a given metric. we tried to get a variety of ways.. For example, about complexity of a configuration in borders of classes it was possible to judge by results of correct recognition of objects by means of linear, piecewise and linear decision functions [1]. Another feature was the use of structural stability of the objects in the disjoint classes. The problem of calculating the stability of a variety of structural measures are being considered within the framework of nonparametric methods of recognition.

Stability shows the local properties in the sample of classified objects. Knowledge of these properties is necessary to determine the anomalous object classes, explaining the reasons for choosing the objects of the minimum coverage standards of learning sample, sufficient for its correct recognition.

The variety value of stability of objects of classes in [4] depended on the choice of the metric. As in polytypic feature space there are no proximity measures with properties of a metrics, it was necessary to use different approaches. Thus, the structural characteristics of the placement of each of the ethalon objects locally and optimal coverage class training sample in artificial neural networks (ANN) with minimal configuration was calculated through a share incorrectly recognized objects during the exam on a set of a moving . The solution of a problem of an estimstion of stability and algorithmic (without the participation of experts) ranking objects of classes on generalized estimates in heterogeneous feature space had not previously considered.

Statement of the problem

We consider the problem of recognition in the standard formulation. It is believed that given a set of objects containing representatives l disjoint classes . Description of objects is performed using a set of n different types of features , of which are measured in nominal scale, on an interval scale.

It is required to compare the stability of objects in a given data and after the preprocessing.

For each construct a sequence objects E ordered with increasing distance from the metric and allocation of set of boundary pairs

, ,

formed from the inequalities

where () — the number of objects from nearest , belonging to the class , . Objects of class make a relative majority for any integer nearest objects to .

The value of functionality F(k) is determined by quantity of the executed inequalities by a set of boundary pairs of of each object , ().

Stability of object of on a metrics of is calculated as

and class

Computational experiment.

To illustrate the process visualization objects was used «Korean» [1] data (which is taken from sociology fields). The set is represented 100 objects with 24 nominal features. Objects are divided into two disjoint classes, K1 (Uzbek people), K2 (Korean people). Results of stability of the objects in a given data are presented in Table1.

Table 1

Stability of the objects in agiven data

Number of Object

Stability

54

1.00

19

1.00

1

1.00

30

0.57

74

0.53

100

0.44

95

0.00

87

0.00

83

0.00

According to Table1 average stability of the first class and second class are equal to 0.74 and 0.69 respectively. Anomalous objects are located in the bottom of the tablle and is choosen according to the low stability. Anomalous objects are presented in Table 2.

Table 2

List of Anomalous objects

Number of object

95

87

83

57

45

23

84

53

49

75

15

10

We perform preprocessing through the changing of the classes of anomalous objects. Result for stability of the objects after the preprocessing are presented in Table 3.

Table 3

Stability of the objects after the preprocessing

Number of Object

Stability

54

1.00

19

1.00

1

0.94

30

0.90

74

0.98

100

0.99

95

0.92

87

0.86

83

0.93

Conclusion.

As we can see in above tables, stabilities of features were better after the preprocessing. For instance the stabilities of 95th and 85th objects were 0.00 in Table1 and it changed to 0.92 and 0.93 respectively. Although the stability of first object decreased average stability of the first class and second class were equal to 0.87 and 0.92 respectively. It means anomalous objects are nearer to other class objects than their class.

References:

  1. Knowledge Discovering from Clinical Data Based on Classification Tasks Solving / N. A. Ignat'ev, F. T. Adilova, G. R. Matlatipov, P. P. Chernyш // MediNFO. — Amsterdam: IOS Press, 2001. — P. 1354–1358.
  2. Игнатьев Н. А. Выбор минимальной конфигурации нейронных сетей // Вычислительные технологии. – Новосибирск, 2001. – Т. 6, № 1. – С. 23-28.
  3. Игнатьев Н. А. Интеллектуальный анализ данных на базе непараметрических методов классификации и разделения выборок объектов поверхностями. – Ташкент, 2008. – 108 с.
  4. Игнатьев Н. А. Обобщенные оценки и локальные метрики объектов в интеллектуальном анализе данных // Монография. – Ташкент: Национальный университет Узбекистана им. МирзоУлугбека, 2014. — 71 с.
  5. Wold S. Pattern recognition by means of disjoint principal components models // Pattern Recognition, 8, № 3, 1976, 127–139.
Основные термины (генерируются автоматически): ANN, IOS, интеллектуальный анализ данных, Ташкент.


Ключевые слова

Устойчивость объекта, Аномальные объекты, Оценка сложности алгоритма, stability of object, anomalous objects, estimation of complexity of the algorithm

Похожие статьи

Logo detection in images with a complex background using the contour information of images

Text detection has gotten a great attention as highly active application-oriented research area in computer vision, artificial intelligence, and image processing. In this article, we implement the algorithm for text logo detection in images with a c...

The use of innovative methods in education

The article presents the types and benefits of interactive methods, as a form of innovative learning. Also, an algorithm for conducting an interactive lesson is presented, and also features of carrying out its main part are stated. The features of ca...

Features of developing mobile applications on the Thunkable platform

This article discusses the possibility of using a cloud environment for developing mobile applications, called Thunkable, in educational processes. The main features of working with the environment, its advantages and disadvantages are considered.

Classification and assessment of fixed assets

The article describes the accounting approach to the classification of long-term assets. It is shown that the classification of long-term assets in the accounting system is important for their evaluation. The current standards provide for three appro...

The modern lesson of a foreign language in the context of the implementation of FSES

The topic of this article is very relevant today since the transition to the new FSES has introduced some innovations into the structure of the modern lesson, where the main task is to activate the student’s cognitive abilities aimed at studying his ...

Protections of converting installations, made on the traditional element base

It is stated that the use of differential protection in converter installations allows, in comparison with maximum current protection, to increase speed and sensitivity. It is mentioned that over the past 20 years a number of new protection devices f...

Modeling the process of teaching a foreign language utterance using multimedia

The article reflects the main stages of modeling teaching utterance in multimedia context as a process of forming a speech action image. The properties of the information space are considered as the basis for image modeling, interactive components —...

Using English teaching applications in an EFL classroom for primary and secondary schoolchildren

This article discusses the idea of using English mobile and web applications in an EFL classroom to aid pupils to increase their productivity and learning process. Various applications to use in the learning process and at home as a self-instrument f...

Features of technical translation

In the presented scientific article the main aspects of technical translation are described. Particular attention is paid to its methods (automated and manual). Four main stages of technical translation are given, the translation of special terms is ...

Forming the phonetic competence in a foreign language at the secondary schools

This article considers about forming the phonetic competence in a foreign language at the secondary schools by the help of various types of exercises. Here are given the basic requirements for improving the pronunciation skills either.

Похожие статьи

Logo detection in images with a complex background using the contour information of images

Text detection has gotten a great attention as highly active application-oriented research area in computer vision, artificial intelligence, and image processing. In this article, we implement the algorithm for text logo detection in images with a c...

The use of innovative methods in education

The article presents the types and benefits of interactive methods, as a form of innovative learning. Also, an algorithm for conducting an interactive lesson is presented, and also features of carrying out its main part are stated. The features of ca...

Features of developing mobile applications on the Thunkable platform

This article discusses the possibility of using a cloud environment for developing mobile applications, called Thunkable, in educational processes. The main features of working with the environment, its advantages and disadvantages are considered.

Classification and assessment of fixed assets

The article describes the accounting approach to the classification of long-term assets. It is shown that the classification of long-term assets in the accounting system is important for their evaluation. The current standards provide for three appro...

The modern lesson of a foreign language in the context of the implementation of FSES

The topic of this article is very relevant today since the transition to the new FSES has introduced some innovations into the structure of the modern lesson, where the main task is to activate the student’s cognitive abilities aimed at studying his ...

Protections of converting installations, made on the traditional element base

It is stated that the use of differential protection in converter installations allows, in comparison with maximum current protection, to increase speed and sensitivity. It is mentioned that over the past 20 years a number of new protection devices f...

Modeling the process of teaching a foreign language utterance using multimedia

The article reflects the main stages of modeling teaching utterance in multimedia context as a process of forming a speech action image. The properties of the information space are considered as the basis for image modeling, interactive components —...

Using English teaching applications in an EFL classroom for primary and secondary schoolchildren

This article discusses the idea of using English mobile and web applications in an EFL classroom to aid pupils to increase their productivity and learning process. Various applications to use in the learning process and at home as a self-instrument f...

Features of technical translation

In the presented scientific article the main aspects of technical translation are described. Particular attention is paid to its methods (automated and manual). Four main stages of technical translation are given, the translation of special terms is ...

Forming the phonetic competence in a foreign language at the secondary schools

This article considers about forming the phonetic competence in a foreign language at the secondary schools by the help of various types of exercises. Here are given the basic requirements for improving the pronunciation skills either.

Задать вопрос