Science Quiz / CPS844 - Data Mining

Random Science or Clickable Quiz

Data Mining Midterm Review

Quiz not verified by Sporcle

Challenge
Share
Tweet
Embed
Score 0/93 Timer 20:00
QuestionAnswerHint
What are some examples of uses in DM & ML?
What is Data Mining?
What is the goal of Data Mining?
What is Machine Learning?
What are the goals of patterns?
What are the different types of patterns?
What are some examples of structural patterns?
Given some .arff data file, what are its instances?
Example = Instance (T/F?)
Given the contact-lens.arff data file, what are some of the attributes?
Given the contact-lens.arff data file, what are its class attributes?
How do you set the class in weka explorer?
What is learning?
How do Machine Learning algorithms work?
Feature = Attribute (T/F?)
Value of Attribute?
What are association rules?
What is tenfold stratified cross-validation?
What is stratified?
Why not use the training set for the testing data?
What is pruning?
What are the advantages of pruning?
Will the accuracy be better on new data?
In decision trees, what is one way to avoid overfitting?
What is Naive Bayes' rule of conditional probability?
What is the prior prbability of the hypothesis?
What if you know the distribution for an attribute is abnormal, and follows the probability density function?
What adjusts the priori probabilities?
What can the priori probabilities be set to?
What is the laplace estimator set to?
What if you don't wnat to guess the PDF, what would you do for the attribute temperature?
QuestionAnswerHint
What is the Divide-and-Conquer Decision Tree?
If antecedent then X? (What is X?)
Decision trees can be easily converted to what kind've set?
What are some advantages of rules over decision trees?
What are some disadvantages of rules over decision trees?
In a multiclass data file, what is generated by machine learning algorithms?
What is a decision list?
In a multiclass data file, is modularlity preserved?
What are the top languages used for analytics, data mining, and data science?
What is the highest level analytitic and data mining software used in the past 12 months on a real project?
What type of model/rule works best with nominal attributes?
What type of model/rule works best with numeric attributes?
What is a linear model?Instead of doing subscripts just write the number beside the variable.
What is the key characteristic that makes linear model equations?
What is linear regression?
What is linear regression best applied to?
How can we use linear regression to predict classes?
What are the advantages of using Linear Classification via Regression?
What are the disadvantages of using Linear Classification via Regression?
What is multiresponse linear regression dependent on?
What are the different ways to apply linear classification?
How does logistic regression differ from linear regression?
How does perceptron differ from the other types of linear classification?
What is one way distance can be measured (Given an instance a)?
What is distance given the attribute-value of a nominal instance?
What is the distance given the attribute-value is missing?
How can we find the nearest neighbours effeciently?
What is the complexity of a straight-forward search?
What is a disadvantage of the kD-tree?
How can we correct the disadvantage of the kD-tree?
What are some advantages of nearest-neighbour instance-based learning?
QuestionAnswerHint
What are some disadvantages of nearest-neighbour instance-based learning?
What are some variations of the ball tree?
What are the four types of clusters?
What is a clique?
This research is based on recommendingDiscovering unknown but interesting items on personal social network
In this research User Similarity Score representsDiscovering unknown but interesting items on personal social network
As per this researchDiscovering unknown but interesting items on personal social network
One of the concerns that related to the sustainability problem isData Mining Solutions for Sustainability Problems
One of the good steps towards the green computing goal isData Mining Solutions for Sustainability Problems
The low server utilization is a problem associated withData Mining Solutions for Sustainability Problems
Why time series data-sets must be reduced (approximated)?Data mining of time series
What is the main problem of DFT (Discrete Fourier Transform) reduction family?Data mining of time series
What is the difference between clustering (pattern discovery) and classification tasks?Data mining of time series
Which is the most effective way to maintain cache consistency in mobile environments?Data Mining Cache in Mobile Networks
After generating caching rules, any rule that meets the following condition is added to the rule setData Mining Cache in Mobile Networks
Which concept does SACCS use to track changes to data objects in cache?Data Mining Cache in Mobile Networks
What method is used for determining cyber criminals?Authorship Identification for Cyber Crime
What is “Write-print”?Authorship Identification for Cyber Crime
How is author determined by Authorship Identification in this presentation?Authorship Identification for Cyber Crime
____ is a system that provides a recommendation, prediction, opinion, or user-configured list of items that assists the user” is a definition ofThe Application of Data-Mining to Recommender Systems
Content-based recommender system that incorporate information retrieval methods are frequently used to satisfy ephemeral needs fromThe Application of Data-Mining to Recommender Systems
Hamlet system is designed to minimize the purchase price of airplane tickets by incorporate time into the recommendation process is an example ofThe Application of Data-Mining to Recommender Systems
The general design principle of Hadoop is built onOn Using Pattern Matching Algorithms in MapReduce Applications
The two main phases of Pattern Matching areOn Using Pattern Matching Algorithms in MapReduce Applications
The theory of “Speaker Recognition” is based onOn Using Pattern Matching Algorithms in MapReduce Applications
Are considered nonlinear statistical data modeling tools where the complex relationships between inputs and outputs are modeled or patterns are found.Data Mining Using ANN for Finding the Effects of Building Structure on Thermal Comfort Parameters.
Is the temperature, humidity and other conditions which occupants experience in a building and dependent on human activities, insulation material, mean radiant temperature, humiditData Mining Using ANN for Finding the Effects of Building Structure on Thermal Comfort Parameters.
Are the physical distance between the sensor and nearest structural property?Data Mining Using ANN for Finding the Effects of Building Structure on Thermal Comfort Parameters.
What was the purpose of the research?Mining Incomplete Survey Data Through Classification
Why did the study use Linear Discriminant Analysis (LDA)?Mining Incomplete Survey Data Through Classification
What are some of the approaches for handling missing data?Mining Incomplete Survey Data Through Classification

You're not logged in!

Compare scores with friends on all Sporcle quizzes.
Sign Up with Email
OR
Log In

You Might Also Like...

Show Comments

Extras

Top Quizzes Today


Score Distribution

Your Account Isn't Verified!

In order to create a playlist on Sporcle, you need to verify the email address you used during registration. Go to your Sporcle Settings to finish the process.