2. Live with vague expectations
If, for example, you currently mail a giant catalog to all customers quarterly, and you want instead to do 100 targeted mailings of specialty catalogs each going to about 2% of your customers, then hold two facilitated discussions with stakeholders. First, talk about what the new mailing process will be and how it will get carried out 25 times as often each year. Second, explore the information capabilities needed to support the process. Then work out data warehouse usage scenarios and develop the necessary workloads, service levels, and other requirements. Don't skip identifying requirements, or you'll end up back at pitfall No. 2.
4. Skip risk analysis
5. Accept flimsy "proofs"
6. Underestimate growth rates Knowing this year's requirements isn't enough.
7. Ignore any dimension of scalability
Illustration by Sek Leung
There's always a temptation to wait too long before doing performance and scalability testing. The classic trap is waiting until the system is ready to go into production. Sure, the test is realistic because you can run the actual database and application, but if you discover something wrong, it's often too late to do anything about it. Test for scalability before you're committed.
Database people think that if no requirements are established, no one can prove they failed. In reality, it's often worse in this situation; management assumes that the system will meet all expectations, so the system is never good enough. You're much better off setting realistic expectations that can be met.More Business Intelligence Insights
White Papers
Webcasts
Reports
Videos
Users often don't know what the requirements are. You have to help them visualize a new business process and the requirements for supporting it. Only then can you develop valid usage scenarios and engineering requirements.
Once you develop requirements, identify, test, and manage the risks that emerge.
Beware of salespeople taking over the definition of the proof. Never let the vendor define the test to be performed. If you don't have the expertise in-house, get a consultant with experience defining benchmark specifications for testing complex data management systems. Your test has to capture the key challenges of scale and performance.
Architectural and platform decisions will take awhile to implement and longer to change. Project requirements out two to three years, at a minimum--better to have a projection that gets revised than shoot in the dark. And don't assume that the data growth rate is the same as the business growth rate. Data and workloads tend to grow faster than the related businesses because data gets used more intensively as the business gains momentum.
Data size is the dimension easiest to measure, but the workload, data complexity, query complexity, availability, and data latency dimensions are nearly as important. They can all drive configuration size and determine whether you're on the right platform. Take them all into account.
Scaling The Data Warehouse
Microsoft And Oracle Are Scaling Out
and
EBay Turns To Analytics As A Service
BP seeking Regional Desktop Coordinator in Houston, TX
Lowes seeking DC Systems Technician I in Lebanon, OR
INVIA Medical Imaging Solutions seeking Software Engineer in Ann Arbor, MI
Citrus Community College seeking Programmer Analyst II in Glendora, CA
City of Westland seeking MIS Director in Westland, MI
For more great jobs, career-related news, features and services, please visit our Career Center.
Green Computing Special Reports
The "reduce/reuse/recycle/save the planet and some cash" movement isn't only about server virtualization, or even just the data center, anymore. In this series...
read more 
NOTE: Offer valid for U.S., U.S. possessions, & Canada only