Usage of the Bag Of Words Model for URL Roman Hujer Main idea of the paper is to use the Bag of words method on URL as strings. As an input will be taken a list with a large quantity of URLs. These URLs will be used to create one bag of words along with quantities (frequencies) of these words. It is assumed that words with high frequencies might be patterns in URL or might suggest some possibilities of classification.
TIMODAZ Michal Višňovský, Pavel Strnad . The article focuses on the analysis of data from the project TIMODAZ that examines the effect of temperature on the concrete lining of the storage of nuclear waste. The article primarily presents the results of the methods used in removing outliers in the data acquired and the manner in which the missing values in the data added. Finally, the arcticle evaluates results and recommendations added.
Algorithm for Missing Values Imputation in Categorical Data with Use of Association Rules Jiří Kaiser . The paper presents new algorithm for missing values imputation in categorical data. The algorithm is based on using association rules and is presented in three variants. Experiments shows better accuracy of missing values imputation using new algorithm than using most common attribute value.
