loading...
خانه دانلود ایرانیان
آندرس اینیستا بازدید : 35 دوشنبه 22 آبان 1391 نظرات (0)

 

Here are some key features of "DataPreparator":

 

General:

· Data access from text files, relational databases, and Excel workbooks

· Handling of large volumes of data (since data sets are not stored in the computer memory, with the exception of Excel workbooks and result sets of some databases where database drivers do not support data streaming)

· Stand alone tool, independent of any other tools

· User friendly graphical user interface

· Operator chaining to create sequences of preprocessing transformations (operator tree)

· Creating of model tree for test/execution data

Data cleaning:

· Character removal

· Text replacement

· Date conversion

رایتر DataPreparator 1.5

Attribute operators on columns in the data set:

· Delete/Move attributes

· Remove selected attributes

· Move selected attributes

· Discretize numeric attributes

· Equal width

· Equal frequency

· Equal frequency from grouped data

· Handle missing values

· Delete records containing missing values

· Remove attributes containing missing values

· Impute missing values

· Predict missing valuues from model (dependence tree, Naive Bayes model)

· Include missing value patterns

· Handle outliers

· Z-score method

· Box-plot method

· Numerate nominal attributes

· Create binary attributes

· Replace nominal values by indices

· Reduce number of labels

· Keep a specified number of most frequent labels and create a new label from the remaining labels.

· Scale numeric attributes

· Decimal

· Linear

· Hyperbolic tangent

· Soft-max

· Z-score

· Other transformations (log(x), 1/x, x2, x3)

· Select attributes

· Manual selection

· Mutual information selecttion

· Robust mutual information selection

 

Record operators on rows in the data set:

· Sampling (random, every k-th item, first-k)

· Select records by key

 

File Utilities that create new files:

· Create data sets

· Create missing values

· Append

· Balance

· Change names

· Merge

· Sort

 

Output:

· Statistics

· Table

· File

· Database

· Visualize

Visualize Numeric attributes:

· Bar chart, cumulative frequency chart

· Box plot (single, conditional)

· Histogtram (single, conditional, normalized, overlaid, histogram matrix)

· Lag plot

· Linear regression plot

· Normal-quantile plot

· Quantile plot

· Quantile-quantile plot

· Run sequence plot

· Scatter plot

Visualize Nominal (categorical) attributes:

· Bar chart, pie chart

· Pareto chart

· Stacked chart

Numeric and nominal attributes:

· Dependence tree

 

Tools:

· Create data sets from raw data

· Create samples from raw data

· Shuffle raw data

· Configure database drivers

 

DOWNLOAD

 

برچسب ها "data , database , write" ,
مطالب مرتبط
ارسال نظر برای این مطلب

کد امنیتی رفرش
اطلاعات کاربری
  • فراموشی رمز عبور؟
  • آرشیو
    آمار سایت
  • کل مطالب : 29
  • کل نظرات : 0
  • افراد آنلاین : 1
  • تعداد اعضا : 1
  • آی پی امروز : 19
  • آی پی دیروز : 0
  • بازدید امروز : 18
  • باردید دیروز : 0
  • گوگل امروز : 0
  • گوگل دیروز : 0
  • بازدید هفته : 18
  • بازدید ماه : 19
  • بازدید سال : 68
  • بازدید کلی : 12,237