Data and Information Retrieval Coursework

220 CT Coursework 2020/2021

 

Don't use plagiarized sources. Get Your Custom Essay on
Data and Information Retrieval Coursework
Get a plagiarism free paper Just from $13/Page
Order Essay

This is an induvial assignment.  Please type everything neatly in a MS word file and submit to the Canvas System on time.

There are 5 tasks in this CW. They are all mandatory – Online submission no later than 23:59 on the last week of semester. This CW represents 50% of your total module mark.

Question #1: Database Design (This task is worth 20 marks)

Introduction

 

The International Space Station (ISS) is a habitable artificial satellite in low Earth orbit. It is the ninth space station to be inhabited by crews following previous orbital stations that were launched by the US the former Soviet Union and later Russia. The ISS is intended to be a laboratory, observatory and factory in space as well as to provide transportation, maintenance, and act as a staging base for possible future missions to the Moon, Mars and beyond. In order to support the crew and overall operation of ISS the space agencies in charge of running the station conduct regular missions to launch spacecraft carrying payloads of essential or replacement equipment up to ISS. A payload inventory, see table below, is recorded of each mission, consisting of the space agency leading the mission and the equipment payload to be sent up to ISS. The overall weight of the payload is also determined in order to calculate the fuel needed for orbital insertion of the spacecraft to successfully rendezvous with ISS.

Mission No. Agency No. Lead Agency Country Mission Date Equipment Qty Item Weight Total Weight
ISS-2237 178 JAXA Japan 14/12/2013 Potable water dispenser 2 100kg 211kg
          Flexible air duct 6 0.5kg  
          Small storage Rack 4 2kg  
ISS-3664 526 ESA EU 16/01/2014 Bio filter 6 0.20kg 1.20kg
ISS-2356 167 NASA USA 12/02/2014 Small storage Rack 3 2kg 69kg
          Battery pack 2 5kg  
          Urine transfer tubing 2 1.5kg  
          O2 scrubber 1 50kg  
ISS-1234 032 Roskosmos Russia 16/04/2014 Small storage Rack 1 2kg 3kg
          Flexible air duct 2 0.5kg  

Currently there is no database being used for managing the payload inventory information in the table above. It is therefore necessary to convert the inventory table into a set of database relations by applying and showing the process of normalization to determine the correct relations.

You are required to normalize the current data inventory to third normal form to produce appropriate relations (tables) for the database by providing the steps from UNF, 1NF, 2NF, to 3NF.It is also required to specify the primary and foreign keys for the 3NF tables. You will then be asked to implement the DB using a database of your choice (MySQL recommended).A script of database creation is required to be submitted too with record contents. The data contained inside the database needs to be retrieved by at least one simple query by mission no.Query script and screen dump of script execution would be expected to be submitted as evidence of successful query execution.

Requirements:

Activity 1: Put data in First Normal Form: Remove Repeating Elements or Groups of Elements in Data (3 marks)

Activity 2:  Put data in Second Normal Form:  Remove Partial Dependencies on a Concatenated Key in Data (3 marks)

Activity 3: Put data in Third Normal Form: Remove Dependencies on Non-Key Attributes / Final Database Design (4 marks)

Database creation script with integrity control (i.e. foreign keys): 7 marks

Data insertion / preparation script: 1.5 marks

Query script: 1.5 marks

 

Question 2– Database and Information Retrieval Systems (This task is worth 20 Marks)

 

  1. Discuss the differences between database and information retrieval systems in terms of their advantages and disadvantages. Find an example for database and information retrieval system to explain your arguments (10 marks).

 

  1. Discuss the differences between data and information and their significance as a corporate resource in an OLTP environment (e.g. retailing business such as supermarket). Give one example for data and information in that environment to explain your arguments (10 marks).

 

 

 

Question 3– A data mining system for a bank (This task is worth 20 Marks)

A bank has been collecting data on their customers and has heard that use of data mining could increase their competitiveness. They would like you to create a brief report that explores the following.

  1. What data mining is and an appropriate application for the bank (5 marks).
  2. How you would go about creating the system using the data mining lifecycle below (5 marks).
  1. The use of a data mining model such as a multilayer perceptron or decision tree to determine a person’s credit worthy using unseen data. Note, you will need to use a data mining tool like Google Colab / Weka to create your models and use the abovedataset to train and test these models. Please provide your analysis source codes and screen captures.  (5 marks).

 

 

 

Question 4: Your Big Data Idea (This task is worth 20 marks)

Identify and implement an idea that you have about how you’d use Big Data for advancedanalytics of retail chain stores such as managing customer relationship, finding new customers, and developing new products and services, finding a new trends that may facilitate understanding some scientific or engineering areas, etc.. Produce a short report the includes you have done in the following steps, your analysis results, data visualization and constructive comments / thoughts.

  1. Purpose an idea and clearly outlined it (e.g. What is the purpose of your data collection and analysis). (2.5 marks).
  2. Acquire the Data. You can do that in many ways including using available public large data sets. (5 marks). Description of the data sets is expected with the attributes and availability details.
  3. Outline the ways to analyze the data in order to achieve the objective you set out for yourself in step 1. (2.5 marks) Using KDD with stepwise explanations would be expected. The description of data mining algorithms (e.g. clustering, association, etc.) would be expected too.
  4. Description of results to be visualized (2.5 marks).
  5. Constructive personal reflection (2.5 marks)
  6. Proper references and in-text citations (5 marks)

 

 

 

Question 5: Data Retrieval (This task is worth 20 marks)

Information Retrieval (IR) includes three main processes: Indexing, Searching and Ranking.

 

Refer to the following diagram, perform the tasks below:

 

(a) build an inverted index (indexing process) (12 marks),

(b) use the Boolean Retrieval Model to find file(s) which contains the words ‘Java’ and ‘SQLDev’ (searching process) (4 marks),

(c) rank the results based on relevance. (ranking process) (4 marks)

 

 

 

Master Homework
Order Now And Get A 20% Discount!
Pages (550 words)
Approximate price: -

Advantages of using our writing services

Custom Writing From Scratch

All our custom papers are written by qualified writers according to your instructions, thus evading any case of plagiarism. Our team consists of native writers from the USA, Canada, and the Uk, making it convenient for us to find the best to handle your order.

Unlimited Free Revisions

If you feel your paper didn't meet all your requirements, we won't stop till it's perfect. You're entitled to request a free revision within 7 days after we submit your paper.

Quality Writing In Any Format

If you have issues with citing sources and referencing, you need not worry. Our writers are highly knowledgeable in referencing, including APA/MLA/Havard/Chicago/Turabian and all other formatting styles.

Fast Delivery And Adherence To The Deadline

All our custom papers are delivered on time, even the most urgent. If we need more time to perfect your paper, we may contact you via email or phone regarding the deadline extension.

Originality & Security

At Master Homework, your security and privacy is our greatest concern. For this reason, we never share your personal information with third parties. We use several writing tools to ensure your paper is original and free from plagiarism.

24/7 Customer Support

Our agents are online 24 hours a day, 7 days a week, and are always ready to serve you. Feel free to contact us through email or talk to our live agents whenever you need assistance with your order.

Try it now!

Calculate the price of your order

We'll send you the first draft for approval by at
Total price:
$0.00

How it works?

Follow these simple steps to get your paper done

Place your order

Fill in the order form and provide all details of your assignment.

Proceed with the payment

Choose the payment system that suits you most.

Receive the final file

Once your paper is ready, we will email it to you.

Our Services

We work nonstop to see the best client experience.

Pricing

Flexible Pricing

We offer pocket-friendly prices that coincide with the preferred client's deadline.

Communication

Admission help & Client-Writer Contact

Our support team is always ready to ensure vital interaction between you and the writer whenever you need to elaborate on something.

Deadlines

Paper Submission

We deliver our papers early within the stipulated deadlines. We are glad to help you if there should be an occurrence of any alterations required.

Reviews

Customer Feedback

Your review, positive or negative, is of great concern to us and we take it very seriously. We are, consequently adjusting our policies to ensure the best customer/writer experience.