Premium Essay

"Enterprise Level Data Work Flows and Data Warehouse

In: Computers and Technology

Submitted By iahammod3175
Words 6349
Pages 26
CSCI 1507 (1903)

"Enterprise level data work flows and Data Warehousing"

Professor Rajni Palikhey
University of Northern Virginia

Acknowledgement

This Research Paper would not have been possible without the guidance and the help of my co-students and respected Professor who in one way or the other contributed and extended their valuable assistance in the preparation and completion of this research paper. I would to like to convey my sense of gratitude to Professor.Rajni Palikhey who helped and supported us right throughout the semester. This paper would not have been possible without her cooperation and technical assistance. We would also thank our Institution and our faculty members without whom this project would have been a distant reality. We also extend our heartfelt thanks to our family and well wishers. I would like to take this occasion to specially thank University of Northern Virginia to provide us with excellent faculty and also in supporting us getting quality education remotely.

Contents

SL No Title Page no
1 Abstract 5
2 Introduction to Databases 6
3 OLTP and OLAP Systems 7
4 Difference between OLTP and OLAP 9
5 Data Modeling 13
6 Workflows in Enterprise level Data warehousing 18
7 Business Intelligence tools used in Data flow and Data Warehousing 21
8 Analysis in Data warehousing 24
9 Conclusion 28
10 Foot Note 30
11 References 31

ABSTRACT
These days majority of the applications, may it be web applications or windows applications or mobile applications, are completely database dependent. Most of the application developments are becoming database driven environments, hence rendering databases as one of the most key elements in a software environment. This dependency on databases can attributed to the increasing number of data requirements from the…...

Similar Documents

Free Essay

Data Warehouses

...For this paper I will be writing about my wife’s place of work because in my line of work right now there is not much use for a database. I am a painter and my employer does not use or need a lot of storage space. In my wife’s line of work they have huge amounts of data that is stored in a data warehouse that is used for the company’s employees. The employees will pull data that is needed each day to do their daily tasks. The company handles with several different parts that make up one part. Each part is labeled and has a barcode so that the part can be identified. The system that holds all of the data is basically a data warehouse that when an employee needs certain information about a single part can be retrieved from the data warehouse. The data warehouse is also used to follow the vendors to see how well they are doing. What I mean by see how they are doing is to follow how many good parts they build versus the bad parts that are built and sent to the company. The data that is recorded when a bad part is found leads to an inspection of the parts. When the inspection starts there are required steps that are followed and entered into the database. When this data is entered into the database it can show how many parts that have been inspected and how many were good and how many bad. With this company’s regular database that is being used every day and the data warehouse being used to see how the parts are being handle and how the...

Words: 473 - Pages: 2

Premium Essay

Data Warehouses and Data Mining

...Data Warehouses and Data Mining Your Name DBM 384 May 13, 2013 Jim Cervi Data Warehouses and Data Mining Data warehouses serve an integral function within many different industries. In the government and law enforcement agencies this is especially prevalent. Vast amount of data and information from multiple sources is often collected by these agencies. This data and information must be put into a format that allows for workable details by the analysts (HowStuffWorks.com, 2012). Data mining and data warehouses provide these agencies with the ability to select specific data out of the large volumes of data available to the analyst Data Warehouse A data warehouse is a database of information collected from several resources, saved under a specific schema, at only one site according to (Siberschatz, Korth, & Sudarshan, 2011). This type of system is effective for government intelligence agencies in storing and categorizing the data sources. By effectively categorizing and storing the data, the data warehouse provides the analyst with a location where an effective query can produce tailored and specific results from vast stores of records. The data warehouse does this by linking the data sources through common threads. These threads are what allow the analyst to access the correct related information through the query. The data warehouse provides the structure of the data sources that the information will be categorized in. To be truly effective, a well-designed......

Words: 575 - Pages: 3

Premium Essay

Creating a Data Warehouse

... Creating a Data Warehouse Introduction Data warehouses are the latest buzz in the business world. Not only are they used to store data for reporting and forecasting, but they are part of a decision support system. There are many reasons for creating and using a data warehouse. The data warehouse will support the decisions a business needs to make, usually on a daily basis. The data warehouse collects data, consolidates the data for reporting purposes. Data warehouses are accompanied by analytical tools that accommodate forecasting as part of the decision support system. The purpose of this paper is to explore the creation of a data warehouse. Since the specifics of creating the data warehouse are determined by the database system, this paper will devote its discussion to the design or layout of the data warehouse. Before discussion of the layout of the data ware house proceeds, the basics about a data warehouse need to be discussed. Then the elements of the data ware house will be covered. What is a Data Warehouse? A data warehouse is a warehouse full of data, an electronic warehouse. In a manner of speaking this is true. Don Awalt describes it as follows, “A data warehouse is the cohesive data model that defines the central data repository for an organization. “ He also further stated that “we consider it a complete, integrated data model of the enterprise, regardless of how of where the data is stored.” Thus we can see that the data warehouse collects and......

Words: 3953 - Pages: 16

Premium Essay

Data Warehouse

...how can data warehousing, data mining & predictive analytics improve a business. Would it be applicable to all types of business or a particular business only? Information Technology develops rapidly, because changes in these technologies are making the people’s lives easier. There’s a growing need for information in market and the competition of handling information. Some businesses needs to improve their ability and capability to handle big data or information. Data warehousing evolved and plays a big or essential role in the storage, information management and to support strategic reporting and analytics of companies. Businesses are investing to integrate their daily operations to be contained in their data warehouse. Businesses aims for a growth to their competitive advantage compared to other organizations. Some of these competitive advantages includes data warehousing, data mining and predictive analytics to be applied with effective use of Information Technology. Data warehouse is designed to support decision making for leaders or owners of an organization. Data warehouse is truly important for which it gives or share all data by every department of an organization that allows decision making in order to achieve good analysis which will help better the organization’s business situation to improve their current operational processes. Data mining is a process that assists data warehouse to dig and analyze big sets of data and extracting the data. It......

Words: 564 - Pages: 3

Free Essay

Data Warehouse

...LB5002:03 – DATA MANAGEMENT AND INFORMATION TECHNOLOGY CO5111:03 – BUSINESS INFORMATION SYSTEMS Assignment 1: Emergent Technologies The purpose of this assignment is to introduce and familiarise students with examples of emerging information technologies and concepts. Assignment 1 consists of two tasks. TASK 1 (5%) Task 1 is an individual task. In this task, each student is required to conduct research on emergent information technology topics. Students are then required to post at minimum of two topics they have researched on the LeanJCU discussion board. TASK 2 (25%) Task 2 is a group assignment with students working in groups of two or three. Each group will select an emergent technology topic from the posted list from task 1. Each group will then write a report and a presentation on the selected topic. The composition of each group will be confirmed and finalised on Session 2 (Townsville – Saturday, 29/10/07, Cairns – Sunday, 7/10/07). Required for task 2: Task 2 involves a group presentation and a written report on the selected emergent technology topic. Both the presentation and report should highlight today’s capabilities of the selected emergent technology by: • • • • • • Describing the technology, its features and functionality (not technical components); describing how organizations can potentially use the technology to improve business performance; describing how organizations are currently using it; identifying issues and risks managers that should know about;...

Words: 1489 - Pages: 6

Premium Essay

Data Warehouse

...innovative goals of the enterprise. The technologies and terms that comprise every major provider’s portfolio are starting to look and sound alike. New product offerings appear almost identical to existing products in the same market. The terms VPN, MPLS, convergence, the ubiquitous “IP,” service level agreements (SLA), single points of contact, managed network services, and global footprints are important in the telecommunications market, but we have heard them all before. The competitive differentiation that service providers desperately seek will not occur on this homogenous slate of technology and service offerings. Only when service providers truly understand what is happening from the customer’s perspective will real competitive differentiation take place. Providers must realize that they do not drive the networking and telecom environment; the customers’ strategic and tactical objectives drive it. If service providers wish to position at higher levels in the corporation, they must change the way they communicate. Such communication should not only show an understanding of the enterprise applications themselves but also an understanding of how the applications relate to the service providers’ product set. This paper will outline three (of the many) enterprise applications and business drivers service providers can use to differentiate themselves. We will examine the concepts of data warehousing and data mining for the purpose of effective enterprise resource......

Words: 5142 - Pages: 21

Free Essay

Data Warehouse

...It is common for facilities of an activity center like those in Larry’s Leisure Center to have peak time. When the number of people using a facility exceeds its capacity, there will be peak time. Too many customers using the facility at the same time can ruin activity experience, and the bad experience can lead to customer loss. Therefore, business intelligence can be used to explore the existence of peak time and to find out matching solutions to make customers remain. In this report, the swimming pool and exercise class are taken as examples which are assumed to have peak time. All the numbers in the report are not real and just given as examples to show how to use the data recorded. In the Pool table, each record means one entry into the swimming pool. Each record includes the exact date and time when the person uses the pool. It needs to count how many customers enter the swimming pool every three hours, and then divide the result by the capacity of the facility to calculate the ratio. When the ratio is far beyond one, it indicates this time period is the peak time and solutions are needed to solve the problem. The figure 1 shows the pool usage on a typical Friday and Saturday. Since the pool capacity is assumed to be 150 people, the 18:00 to 21:00 time period on Friday and 18:00 to 21:00 time period on Saturday are peak time. The ratio of the two time periods are 1.47 and 1.42 respectively, which are far beyond one and show too many people are sharing the crowded pool...

Words: 862 - Pages: 4

Premium Essay

Data Warehouse

...Make a screen capture showing the details in the Host Details tab and paste it in your Lab Report file. Repeat steps 12 and 13 for each host in the scan. In the Command box, highlight -O, type -sV and press Enter to run a software version scan. In the SYN scan from earlier in the lab, Zenmap identified the services running on the machines, but not the versions. This scan will discover the versions of the software on open TCP ports and will make a guess at the OS based on the services. As a result, unlike the fingerprint -O scan, the service -sV scan can provide a more detailed OS version. The scan was even able to detect the operating system on 172.30.0.7 as Linux, but this level of detail will take a little longer to run than the previous scans. Figure 9 Software version scan results Click 172.30.0.7 in the left pane and click the Ports/Hosts tab. The version for the services running on the TCP protocol are now visible in the Ports/Hosts tab. Figure 10 Software version results for each port Make a screen capture showing the details in the Ports/Hosts tab and paste it in your Lab Report file. Repeat steps 16 and 17 for each host in the scan. Click Scan > Save All Scans to Directory and navigate to the Security_Strategies folder (Local Disk (C:) > Security_Strategies), click the Create Folder button at the top right, type Scans and click Save. Figure 11 Save all scans Close the Zenmap......

Words: 1764 - Pages: 8

Premium Essay

Data Warehouse

...WHAT IS DATA WAREHOUSE AND WHY IS REI BUILDING ONE? A data warehouse can be described as a “database that stores current and historical data of potential interest to decision makers throughout a company. The data originate in many core operational transaction systems, such as systems for sales, customer accounts, and manufacturing, and may include data from Web site transactions. REI is building a data warehouse to improve the company and to meet the needs of the customers. The data warehouse will allow the company to view current and past data on sales, products, and customer information and also allow for the company to get to know the customers better and help in seeing which products are selling and become closer to the consumer and tailor goods to the needs of the consumer. WHAT ARE SOME OF THE DISADVANTAGES OF CONSUMER COOPERATIVES COMPARED TO TRADITIONAL FIRMS? Consumer cooperatives have some disadvantages in comparison to traditional firms. Consumer cooperatives require a high level of organization. Because the consumers are helping to make many decisions there are more legal responsibilities for the company. The company must listen to the consumers and also provide rules that the consumer cooperative must follow as a whole. While it is great for the consumers to be so involved, traditional firms have less of a hassle and don’t need to take so many extra steps when making decisions. DESCRIBE SOME OF THE MARKETING STRATEGIES THAT REI’S DATA WAREHOUSE WILL......

Words: 314 - Pages: 2

Premium Essay

Databases vs Data Warehouses

...DIFFERENCE BETWEEN DATABASE AND DATA WAREHOUSE 1 Database vs Data Warehouse Patricta Eric Doller Prudue University Relation Database Management Systems Bob Estein March 14, 2015 DIFFERENCES BETWEEN DATABASE AND DATA WAREHOUSE Relational database versus a data warehouse Businesses use new technology in many aspect of running everyday duties, like record keeping. To keep these records organized, companies have separate database and data warehouses. A database is used for a single application, mostly for transactions. These transactions can range from payroll, inventory to sales and any other transaction the company needs on a daily bases. A data warehouse is used for multiple domains running simultaneously. A company should use a data warehouse to show how they are doing, in whole, rather than just in certain areas. The warehouse can also track business trends. Companies do not usually do not put all their information into one database because of the possibility of being hacked into easily by a Hacker and used for the wrong intent. Although, it 2 would be cheaper to have just one database the security risks are too high. These problems would lead to dissatisfied customers, lack of business and lawsuits. So how is a data warehouse different from your regular database? After all, both of these are database, and they tend to function the same way. If you look deeper into them, you will find that they both have tables and they contain data. They both have indexes,......

Words: 1271 - Pages: 6

Premium Essay

Business Data Warehouse

...diagram, the following: Primary Data Warehouse and Data Mart. In this connection, explain the difference between ROLAP and MOLAP. A Primary Data Warehouse is a central repository of a database of a complete organization. It holds multiple subject areas and very detailed information. A Data Mart is a subset or an aggregation of the data stored to a primary data warehouse. It often holds only one subject area – for example, a specific department, finance or sales. It may hold more summaried data, and is typically smaller than a warehouse because of its employment on a different grain. Figure 1.1 illustrates the difference between data mart and a primary data warehouse. Since the data mart typically holds one subject area, it is much smaller than a primary data warehouse. These data marts can be viewed as small, local data warehouses replicating the part of primary data warehouse which is required by a specific domain or department. Data Warehouse Data Mart Data Warehouse Data Mart Figure 1.1 A data warehouse does not necessarily use a dimensional model, since it is partly normalized RDBMS, but data marts are multidimensional cubes. This connection gives arise to two concepts, ROLAP and MOLAP. ROLAP is an implementation based on a relational database, in our case which is a primary data warehouse, and MOLAP is an implementation based on a multidimensional database which are data marts. ROLAP tools use the relational database to access the data and generate SQL queries......

Words: 685 - Pages: 3

Premium Essay

Data Warehouse

...What is a Data Warehouse • A data warehouse is a relational database that is designed for query and analysis. • It usually contains historical data derived from transaction data, but it can include data from other sources. Finance, Marketing, • Data warehouse can be:  Subject Oriented  Integrated  Nonvolatile  Time Variant Inventory SAP, Weblogs, Legacy Identical reports produce same data for different period. daily/monthly/quarterly basis Why Data Warehouse • • • • Provide a consistent information of various cross functional activity. Historical Data. Access, Analyze and Report Information. Augment the Business Processes Why is BI so Important Information Maturity Model Return on Information BI Solution for Everyone BI Framework Business Layer Business goals are met and business value is realized Administration & Operation Layer Business Intelligence and Data Warehousing programs are sustainable Implementation Layer Useful, reliable, and relevant data is used to deliver meaningful, actionable information BI Framework Business Requirements Data Sources Data Sources Data Acquisition, Cleansing,& Integration Data Acquisition, Cleansing, & Integration Data Stores Data Stores Information Services Information Delivery Information Delivery Business Analytics Business Analytics Business Applications Business Applications Business Value Business Value Development Data Resource......

Words: 3637 - Pages: 15

Premium Essay

Mining the Data Warehouse

...Mining the Data Warehouse Summary In “Mining the Data Warehouse”, It speaks of a survey done by Merrill Lynch back in 2006. It tells us that “business intelligence software and data-mining tools were at the top of CIOs’ technology spending list” (Baltzan, Hag, Phillips 87). It gives a few examples of how companies are using the software and tools to gain very valuable information. When Ben & Jerry’s is mentioned, people know the brand and immediately think of ice cream. “Ben & Jerry’s cuts through the din by using integrated query, reporting, and online analytical processing technology from BI software vendor Business Objectives” (Baltzan, Hag, Phillips 87). They use the technology to track each pint’s ingredients throughout its life. If there is a complaint made by a customer, they will track it back through ingredients, suppliers, or whatever caused the issue. They are extremely focused on quality of their products. “The BI tools let Ben & Jerry’s officials access, analyze, and act on customer information collected by the sales, finance, purchasing, and quality-assurance departments” (Baltzan, Hag, Phillis 87). They have gotten it down to a science. They can tell you what milk a customer prefers for the ice cream. In 2005, they tracked over 12,500 customer’s information and comments. The California Pizza Kitchen has 130 casual dining full-service restaurants throughout the many states and other countries. They are known for their premium pizza. ......

Words: 1667 - Pages: 7

Premium Essay

Data Warehouse

...Proposal – Data Warehouse Inventory What is data warehouse? It’s a simple assembling of data gathered from various sources and available to the global customers in a way they can understand to their business level. Statement of Problem: Our intention is to develop inventory application (User interface) so that client can easily track their warehouse. For creating interface we are using ETL and IBM Cognos. Methods to solve problem: * Creating Database for customer * Extract transform and loading the data * Data Mapping * Creating Package from Framework Manager using IBM Cognos * Developing Interface using IBM Cognos Report Studio Tools: IBM DB2, Informatica Power center, IBM Cognos 10.2 Resources: 3 Client: Volvo Trucks. Software Development Life Cycle 1. Requirements 2. Design 3. Development 4. Testing 5. User Signoff Requirements for data warehouse design: A. Goals: 1. To design the flexible data warehouse as per the client requirements. 2. Get the raw data into the centralized repository so that the users can access it in a flexible manner. 3. We should make sure the data cannot be edited by the users. B. Clients Requirements: 1. We need to get all the details of the client products available in their inventory. 2. We need to study the client environment and take down the end to end process. C. Business Questions: 1. Need to discuss about the design duration. 2. Number of resource required for the...

Words: 316 - Pages: 2

Premium Essay

Modeling Data Warehouse

...Question: 1. When developing a successful data warehouse, what are the most important risks and issues to consider and potentially avoid? Data warehouse projects have many risks. Most of them are also found in other IT projects, but data warehousing risks are more serious because data warehouses are expensive, time-and-resource demanding, large-scale projects. Each risk should be assessed at the inception of the project. When developing a successful data warehouse, it is important to carefully consider various risks and avoid the following issues: • Starting with the wrong sponsorship chain. You need an executive sponsor who has influence over the necessary resources to support and invest in the data warehouse. You also need an executive project driver, someone who has earned the respect of other executives, has a healthy skepticism about technology, and is decisive but flexible. You also need an IS/IT manager to head up the project. • Setting expectations that you cannot meet. You do not want to frustrate executives at the moment of truth. Every data warehousing project has two phases: Phase 1 is the selling phase, in which you internally market the project by selling the benefits to those who have access to needed resources. Phase 2 is the struggle to meet the expectations described in Phase 1. For a mere $1 to $7 million, hopefully, you can deliver. • Engaging in politically naive behavior. Do not simply state that a data warehouse will help managers make better......

Words: 2805 - Pages: 12