When you define your DB2 ReportCenter databases, you specify how much space to allocate for each separate database table. Space is physically allocated in DASD units, but database size more usefully refers to the logical number of table rows.
ReportCenter uses a very simple data warehousing star schema. Star refers to data model designs where multiple dimension tables (in ReportCenter's case, Resource, Attribute, and Period) revolve around large central fact tables (NumericFact and EnumeratedFact).
Details of each different resource, report attribute, and time period are stored in the database once only-in a row in their dimension table. When a data observation arrives for a specific resource, attribute, and period, the value is stored in a row in one of the fact tables, with pointers to the associated resource, attribute, and period dimension table rows.
It is the fact tables, NumericFact and EnumeratedFact, that vary most in size between ReportCenter implementations.
Typical NumericFact table sizes may range from less than one million to tens of millions of rows.
EnumeratedFact table sizes are generally 20% - 30% of the number of rows of the NumericFact table, although this varies greatly depending on the specific data collected. For the same interval, EnumeratedFact data takes more physical space than NumericFact data.
ReportCenter dimension and internal-use tables are reasonably constant in size between implementations.
After you have defined most of your essential data collection, ReportCenter data regions send data to the database at a more-or-less constant rate. However, the database does not grow at a constant rate, for the following reasons:
You should run the DB2 reporting utilities regularly to monitor the growth of each table.
Example database space allocations for the ReportCenter database are specified in the distributed dsnpref.NMC1.CC2DSAMP(WRDB2CRE) sample JCL member. Review the comments in this member carefully.
Member WRDB2CRE contains an example of row, cylinder, and block requirements for a database that should be adequate for most organizations. For all tables, a secondary extent is specified with 10% of the primary value.
If you have a small or medium scale CA NetMaster implementation, we recommend that you use the default primary space allocations. However, you should review and, if possible, remove the secondary allocations on the NumericFact and EnumeratedFact tables.
DB2 tables are physically stored as VSAM files. A large number of VSAM extents can badly impact SQL performance and CPU consumption required for Input/Output. Review your organization standards and, if possible, specify no secondary extents on at least the fact tables.
You may get errors sooner, when or if the primary extent fills, but you will avoid situations such as enormous tables spread over 120+ extents and multiple devices. This applies equally to index space allocations.
You only need to estimate the number of rows in the NumericFact table. The EnumeratedFact table generally has one quarter to one third of the number of NumericFact rows. Most report data is numeric.
The number of rows for the remaining six tables does not vary as widely and the sample allocations should be adequate.
The number of NumericFact rows depends on how much data you collect and the length of time you keep it. In practice, it is not easy or necessary to predict exactly when a table will reach a given size. You should aim to have a primary extent large enough to hold all the NumericFact rows you are likely to need. Regular monitoring gives an indication of the growth of the fact table.
Example: Estimate Numeric Fact Rows
Use the following as a very rough guideline. If you have data feeding in from a region that is sending all possible report data from a single stack and a small number of each other resource type, you need approximately 500,000 rows after six months, after which the rate of increase should slow down. This is an indication only-individual regions can monitor more than this. This example uses the default retention periods.
To minimize the space required, send to ReportCenter only the data on which you want to report. Try to prevent your database filling up with data you do not want and do not know is there.
The following are ways that you can minimize the space required:
Data is added to and deleted from your ReportCenter live database every day. Regular database reorganizations are essential to reuse and reclaim space.
| Copyright © 2012 CA. All rights reserved. |
|