Premium Essay

Data Warehousing and Data Mining

In: Computers and Technology

Submitted By hajas
Words 8284
Pages 34
Table of Contents Introduction 2 Assumptions 3 Data Availability 3 Overnight processing window 3 Business sponsor 4 Source system knowledge 4 Significance 5 Data warehouse 6 ETL: (Extract, Transform, Load) 6 Data Mining 6 Data Mining Techniques 7 Data Warehousing 8 Data Mining 8 Technology in Health Care 9 Diseases Analysis 9 Treatment strategies 9 Healthcare Resource Management 10 Customer Relationship Management 10 Recommended Solution 11 Corporate Solution 11 Technological Solution 11 Justification and Conclusion 12 References 14 Health Authority Data (Appendix A) 16 Data Warehousing Implementation (Appendix B) 19 Data Mining Implementation (Appendix B) 22 Technological Scenarios in Health Authorities (Appendix C) 26 Technology Tools 27

Data Management Technology
Introduction

The amount of information offered to us is literally astonishing, and the worthiness of data as an organizational asset is widely acknowledged. Nonetheless the failure to manage this enormous amount of data, and to swiftly acquire the information that is relevant to any particular question, as the volume of information rises, demonstrates to be a distraction and a liability, rather than an asset. This paradox energies the need for increasingly powerful and flexible data management systems. To achieve efficiency and a great level of productivity out of large and complex datasets, operators need have tools that streamline the tasks of managing the data and extracting valuable information in a timely fashion. Fading to do so may incur an elevation cost of attaining it and managing it which may exceed the worth that was derived from it.

Data Analytics Systems have been used intensively and extensively by numerous industries. Data mining techniques are pleasant increasingly widespread and ever more essential in the areas of…...

Similar Documents

Premium Essay

Data Mining

...A data warehouse is a subject-oriented, integrated, time-variant, nonupdateable collection of data used to support management decision-making and business intelligence (Hoffer, 2011). Business Intelligence (BI) is a term that describes a comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store and analyze data with the purpose of generating and presenting information to support decision making (Coronel, Morris, & Rob, 2013). Data Warehouse A data warehouse enables an organization to obtain the information about future trends and track customer demands. The key terms that define data warehouse are subject-oriented, integrated, time-variant, and nonupdateable. Each one has its meaning and importance in data warehousing. Subject-oriented – A data warehouse is organized around the key subjects that may include but not limited to customers, patients, students, products, and time. Integrated – The data in the data warehouse are defined using consistent naming conventions, formats, structures, and related characteristics. This means data warehouse holds one version of “the truth”. Time-variant – Data in the data warehouse contain a time dimension so they could be used to study trends and changes. Nonupdateable – Once the data gets loaded into the data warehouse, it could not be updated by the end users. Data warehousing is a process where organizations create and maintain data warehouses and extract......

Words: 1390 - Pages: 6

Premium Essay

Data Mining

...Data Mining: What is Data Mining? Overview Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Continuous Innovation Although data mining is a relatively new term, the technology is not. Companies have used powerful computers to sift through volumes of supermarket scanner data and analyze market research reports for years. However, continuous innovations in computer processing power, disk storage, and statistical software are dramatically increasing the accuracy of analysis while driving down the cost. Example For example, one Midwest grocery chain used the data mining capacity of Oracle software to analyze local buying patterns. They discovered that when men bought diapers on Thursdays and Saturdays, they also tended to buy beer. Further analysis showed that these shoppers typically did their weekly grocery shopping on Saturdays. On Thursdays, however, they only bought a few items. The retailer concluded that they purchased the beer to......

Words: 1657 - Pages: 7

Premium Essay

Data Mining

...Data Mining Objectives: Highlight the characteristics of Data mining Operations, Techniques and Tools. A Brief Overview Online Analytical Processing (OLAP): OLAP is the dynamic synthesis, analysis, and consolidation of large volumns of multi-dimensional data. Multi-dimensional OLAP support common analyst operations, such as: ▪ Considation – aggregate of data, e.g. roll-ups from branches to regions. ▪ Drill-down – showing details, just the reverse of considation. ▪ Slicing and dicing – pivoting. Looking at the data from different viewpoints. E.g. X, Y, Z axis as salesman, Nth quarter and products, or region, Nth quarter and products. A Brief Overview Data Mining: Construct an advanced architecture for storing information in a multi-dimension data warehouse is just the first step to evolve from traditional DBMS. To realize the value of a data warehouse, it is necessary to extract the knowledge hidden within the warehouse. Unlike OLAP, which reveal patterns that are known in advance, Data Mining uses the machine learning techniques to find hidden relationships within data. So Data Mining is to ▪ Analyse data, ▪ Use software techniques ▪ Finding hidden and unexpected patterns and relationships in sets of data. Examples of Data Mining Applications: ▪ Identifying potential credit card customer groups ▪ Identifying buying patterns of customers. ▪ Predicting trends of......

Words: 1258 - Pages: 6

Premium Essay

Data Warehousing

...head: SHORT TITLE OF PAPER (<= 50 CHARACTERS) Data Warehousing and Data Mining Bruce Nimo CIS 111 March 19, 2012. Prof Jones Data mining is a process of numerical analysis. Analysts use technical tools to query and sort through terabytes of data looking for patterns. Usually, the analyst will develop a hypothesis, such as customers who buy product X usually buy product Y within six months. Running a query on the relevant data to prove or disprove this theory is data mining. Data warehousing describes the process of designing how the data is stored in order to improve reporting and analysis. Data warehouse experts consider that the various stores of data are connected and related to each other conceptually as well as physically. A business's data is usually stored across a number of databases. However, to be able to analyze the broadest range of data, each of these databases needs to be connected in some way. This means that the data within them need a way of being related to other relevant data and that the physical databases themselves have a connection so their data can be looked at together for reporting purposes. Businesses then use this information to make better business decisions based on how they understand their customers' and suppliers' behaviors. Examples of businesses that use data warehousing and data mining are amason.com, Wal-Mart stores Inc etc. Both data mining and data warehousing are business intelligence tools that are used to turn......

Words: 1181 - Pages: 5

Premium Essay

Data Warehouses and Data Mining

...Data Warehouses and Data Mining Your Name DBM 384 May 13, 2013 Jim Cervi Data Warehouses and Data Mining Data warehouses serve an integral function within many different industries. In the government and law enforcement agencies this is especially prevalent. Vast amount of data and information from multiple sources is often collected by these agencies. This data and information must be put into a format that allows for workable details by the analysts (HowStuffWorks.com, 2012). Data mining and data warehouses provide these agencies with the ability to select specific data out of the large volumes of data available to the analyst Data Warehouse A data warehouse is a database of information collected from several resources, saved under a specific schema, at only one site according to (Siberschatz, Korth, & Sudarshan, 2011). This type of system is effective for government intelligence agencies in storing and categorizing the data sources. By effectively categorizing and storing the data, the data warehouse provides the analyst with a location where an effective query can produce tailored and specific results from vast stores of records. The data warehouse does this by linking the data sources through common threads. These threads are what allow the analyst to access the correct related information through the query. The data warehouse provides the structure of the data sources that the information will be categorized in. To be truly effective, a well-designed......

Words: 575 - Pages: 3

Premium Essay

Data Mining

...Data Mining/Data Warehousing Matthew P Bartman Strayer University Ibrahim Elhag CIS 111– Intro to Relational Database Management June 9, 2013 Data Mining/Data Warehousing When it comes to technology especially in terms of storing data there are two ways that it can be done and that is through data mining and data warehousing. With each type of storage there are trends and benefits. In terms of data warehousing there are 5 key benefits one of them being that it enhance business intelligence. What this means is that business processes can be applied directly instead of things having to be done with limited information or on gut instinct. Another benefit of data warehousing is that it can also save time meaning that if a decision has to be made the data can be retrieved quickly instead of having to find data from multiple sources. Not only does data warehousing enhance business intelligence and save time but it can also enchance data quality and consistency.This is accomplished by converting all data into one common format and will make it consistent with all departments which ensures accuracy with the data as well. While these key benefits another one is that it can provide historical intelligence which means that analayze different time periods and trends to make future predictions. One other key benefit is that it provides a great return on investment. The reason being that a data warehouse generates more......

Words: 2018 - Pages: 9

Premium Essay

Data Mining and Data Warehousing

...Data mining and warehousing and its importance in the organization * Data Mining Data mining is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Data mining is primarily used today by companies with a strong consumer focus - retail, financial, communication, and marketing organizations. It enables these companies to determine relationships among internal factors such as price, product positioning, or staff skills, and external factors such as economic indicators, competition, and customer demographics. And, it enables them to determine the impact on sales, customer satisfaction, and corporate profits. Finally, it enables them to drill down into summary information to view detail transactional data. For example, “Entertainers Incorporated” is an organization which deals with entertainers for events. So the need to attract customers and communicating with them is essential. Customer satisfaction in their service is much needed for them, for the customers to approach them for the next event too. So considering......

Words: 1344 - Pages: 6

Premium Essay

Data Warehousing

...Data warehousing: what is it, why do we want to do it, how is it done? How do data warehouses compare with operational databases? What do we want to consider when doing so, and what are our options in terms of implementation? In this article, I discuss data warehouses: what they are, how they compare to operational databases, and how they are designed, implemented and maintained. An operational database is one which is used by the enterprise to run its day to day operations. They are created to support fast transaction processing, with frequent updates. Speed is key to operational databases. They typically are used by clerical staff, and are on the order of megabytes of data to gigabytes. Database consistency is very important to operational databases, and consistency checks and constraints are rigidly enforced. They contain the most current set of data applicable to running enterprise operations. These are our sales and inventory databases. A data warehouse differs from this in many ways. They are used by management for making decisions, watching trends, and running reports. They are typically used offline, have few users and are very large: gigabytes to terabytes. They contain historical data, are read only, and are added to but rarely or never updated (the rows in the database are not changed, I mean). The data in the data warehouse is time sensitive – each row is the warehouse is timestamped so that trending of data versus time can be done. The kinds...

Words: 1530 - Pages: 7

Premium Essay

Data Mining

...Many organizations use data from sources such as, card readers, bar code scanners, and points of sales (Anderson, Sweeney, and Williams 17). The data is stored and kept for future usage in analyses and studies; this process is called data warehousing (17). Technological advances allow preserving and recovering great volumes of data for upcoming analyses by using warehouse data (17). All this information is critical when managers are taking decisions involving uncertainty (17). Data mining is a very important tool in making decisions because it implements a set of processes from different fields that work together to ease the way of obtaining information from large databases (17). However, it is not enough to have all the data condensed; it is necessary to obtain information that is understandable and appropriate to make decisions (17). In order to create new strategies or formulate new programs to improve the business, decision makers should use data mining systems, which allow automatic processes to find predictive information useful to choose right decisions (17). Another important feature of data mining as a decision making tool is the effectiveness of it to develop predictive models (18). Nevertheless, decision makers should be careful when developing these kinds of models because sometimes it is not reliable and may be misleading if it is constructed using too many parameters (18). These problems can be avoided testing the accuracy of the model (18). Works Cited......

Words: 258 - Pages: 2

Premium Essay

Data Mining

...Data Mining In today’s society there is more data and information that is being stored it is hard to sort through it all and pick out useful information. All of this information is being stored in data warehouses which are basically a huge database filled with information collected over many years. This is why data mining is so popular, it has the ability to search through these huge amounts of data and find useful information, or will be able to see patterns in the information that may have not been noticed any other way. Also data mining tools can help organize data to clarify any patterns found, this allows for more accurate and reliable findings. There are many different types of patterns that can be found with data mining that could be useful to stores, such as what products are often sold together or what products are sold best in cretin areas. There are some different tools used when data mining some examples would be classification tools, clustering analysis tools and association rules discovery. Before these tools for data mining were used the way people used to find information was with hypothesis verification, this technique would find information based on what the user’s hypothesis which limited how effective this tool was (Ramsey). Classification tools are the most commonly used in data mining, these tools have the ability to try and difference between objects or actions. An example of this would be if a company wants to know what part of their advertisements...

Words: 850 - Pages: 4

Premium Essay

Data Warehousing and Data Mining

...According to Lee, the most popular definition is a data warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of management’s decision making process (2014). Basically a data warehouse is a copy of transaction data specifically structured for query and analysis. According to Frand, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information – information that can be used to increase revenue, cut costs, or both (1997). There are many benefits of data warehousing. Yes, it will cost large amounts of money from businesses to have a data warehouse but, in the long run it is worth it to have in a corporation. One benefit is that data warehouses stores and presents information in a way that allows management to make important decisions (Prathap, 2014). Management and even executives can look at the business as a whole instead of by each department. According to Prathap, another benefit of data warehouses is their ability to handle server tasks connected to querying which is not used in most transaction systems (2014). Creating queries and reports can take time and with data warehousing, the server can handle the tasks in a timely fashion. Again, according to Prathap, one of the most important benefits of data warehouses is that they set the stage for an environment where a small amount of technical knowledge about databases...

Words: 1726 - Pages: 7

Premium Essay

Data Warehousing and Data Mining

...Term Paper Data Warehousing and Data Mining Term Paper Data Warehousing and Data Mining A Data Warehouse Saves Time, Enhances Data Quality and Consistency, Provides Historical Intelligence, Generates a High return on investment. One of the benefits to using a data warehouse is that it conveys Business Intelligence By providing data from various sources. Management executives no longer need to make decisions based on finite data. Time is saved by allowing business users to get data quickly from one location so they can make quick knowledgeable decisions and will not waste time trying to get data from several places. Using a data warehouse implementation provides for better data quality and consistency by converting data from different sources into an individual common format. Since data from each department is standardized, each department will have results that are in alignment with the results of all other departments. Doing so assures the accuracy of data and that provides for better business decisions. Since a data warehouse holds large volumes of historical data, it can analyze trends over a time frame so that predictions can be made. This type of data usually cannot be kept or used to get reports in a transactional database. Finally the return on investment for companies instituting a data warehouse and a Business Intelligence system generate more money while also saving money. The purpose of data mining is to provide knowledge to give company an......

Words: 1217 - Pages: 5

Premium Essay

Data Warehousing

...Training Curriculum Data Warehousing: • Introduction to Business Objects Enterprise Reporting • Fundamentals of Data warehouse Concepts • Introduction to Dimensional Modeling • Developing a Star Schema Reporting: • Building and editing queries with Web Intelligence • Performing on report analysis with Web Intelligence • Filtering Queries using conditions, prompts etc., • Using Combined Queries and merging dimensions • Displaying data in various formats (Ex: Tables, Charts etc.,) Advanced Reporting: • Calculations, Formulas and variables • Ranking Data, using Alerters to highlight data, Formatting numbers and Dates • Understanding Calculation Contexts • Web Intelligence Functions, Operators and Keywords • Calculating values with Smart Measures Universe Designer: • Designer and Universe Fundamentals • Creating a schema with Tables and Joins • Resolving Join problems in a schema • Defining Classes, Objects, hierarchies, using cascading list of values for hierarchies • Testing the universe • Working with OLAP universes Xcelsius 2008: • Application Overview • Creating and Updating Xcelsius visualizations • Using Xcelsius components ( Chart, Containers, Selectors etc.,) • Exporting Xcelsius visualizations to various applications (Power point, PDF, Flash • Creating templates, Alerts and Dynamic visibility • Using Data Manager ( Creating and configuring connections) • Live Office Connections, Query As A Web Service (QWAAS), XML data......

Words: 462 - Pages: 2

Premium Essay

Data Warehousing

...Data Warehouses The basic reasons organizations implement data warehouses are: To perform server/disk bound tasks associated with querying and reporting on servers/disks not used by transaction processing systems most firms want to set up transaction processing systems so there is a high probability that transactions will be completed in what is judged to be an acceptable amount of time. Reports and queries, which can require a much greater range of limited server/disk resources than transaction processing, run on the servers/disks used by transaction processing systems can lower the probability that transactions complete in an acceptable amount of time. Or, running queries and reports, with their variable resource requirements, on the servers/disks used by transaction processing systems can make it quite complex to manage servers/disks so there is a high enough probability that acceptable response time can be achieved. Firms therefore may find that the least expensive and/or most organizationally expeditious way to obtain high probability of acceptable transaction processing response time is to implement a data warehousing architecture that uses separate servers/disks for some querying and reporting. To use data models and/or server technologies that speed up querying and reporting and that are not appropriate for transaction processing There are ways of modeling data that usually speed up querying and reporting (e.g., a star schema) and may not be appropriate for......

Words: 1160 - Pages: 5

Premium Essay

Data Warehousing

...Data Warehousing Saikrishna Burugula IST 7000 Data Management Wilmington University Abstract A data storage could be a subject-oriented, integrated, time-variant, non-updateable assortment of knowledge utilized in business intelligence and support of organizing decision-making method (Inmon, Strauss & Neushloss, 2008). In data warehousing when the data is stored it is not updated, commonly data warehousing intended for evaluation connected with data source in addition to addressing queries it can be called copy of addressing data (Prabhu, 2002). The key intention with this paper is typically to target on the actual design connected with data warehouse in addition to modeling techniques like ER modeling and Dimensional modeling. Introduction A Data Warehouse is not just a new combination of all of the in business databases in an organization. Because of its attention on business intelligence, exterior data, and time variant information, a data warehouse is usually a special type of database. The good thing is, you should not learn another number of database abilities to do business with a new information storage place. Most data warehouses tend to be relational databases designed in many ways optimized pertaining to selection assistance, definitely not in business information running. Facts warehousing could be the procedure whereby organizations create and gaze after information...

Words: 1513 - Pages: 7