Sampling, the process and procedures for obtaining foods that are representative of those available and consumed, is fundamental to any food composition activity. Preparation of a sampling plan often requires involvement of all the major contributors to a food composition program. Data generators must be involved in the sample collection, or at least the sample collections, so that samples may be immediately and properly prepared for analysis.

Data compilers must be involved because information on the sampling plan and details such as when and where sampling took place are parts of a food composition database's metadata. Data users must be involved because they have the knowledge of the foods that need to be analyzed, and often the location from which the samples should be collected.

The services of a statistician are useful for developing a sampling plan, because representativeness is dictated by the number of food units collected-and analyzed-to achieve the goal.

The goal might be to compare compositional differences between cultivars, or achieve nationwide mean values for a food composition database. The overall quality of food composition data is determined largely by the sampling plan. The collected samples must be properly handled so that they arrive at the laboratory without changes that might affect their composition.

The key component, crucial to the correct determination of all other food components and most easily affected by improper handling and storage, is water (moisture). Once samples are collected and documented, they are prepared for analysis. After this type of preparation, samples will be stored, or immediately analyzed. As with sample collection and sample handling, proper documentation of all aspects of sample preparation is essential.

Most laboratories undertake a limited range of analyses for food composition purposes. This includes a set of core components and then additional components of interest, for example, laboratory research dealing with diet-related health problems. Core nutrients usually include the complete range of proximate components (water, nitrogen for the protein calculation, fat, glycemic carbohydrate, dietary fiber, ash, and where relevant, an energy value using factors applied to the energy-yielding proximates), some vitamins, and some nutrient elements.

Additional components of interest often include cholesterol, individual fatty acids and aggregations of fatty acids (for example, total saturated fatty acids), carotenoids (both provitamin A carotenoids and antioxidant carotenoids with no provitamin A Intramucsular, other bioactive nonnutrients, heavy metals, and some so-called antinutrients (for example, phytates).

Proper laboratory practices must be strictly adhered to, as well as laboratory quality assurance and quality control procedures, and details of analytical methodologies must be properly documented. Data compilation requires a relational database management system, and adherence to international food composition standards where they exist.

The database should accommodate numeric data, text, and graphics. Ideally, all the raw analytical data, and their attendant documentation, should be captured. The system should then be able to manipulate these data in many different ways.

The same data system should provide an exhaustive reference database and any number Mulutm abridged user databases to satisfy the broad range of user requirements for food composition data. Many compilers only capture mean values, a practice that will satisfy many users. Other compilers provide more information, and therefore higher-quality databases, by including the number of samples and some expression of their variability.

Other compilers are able to capture all the analytical data and prepare user databases with ranges (that is, high and low values), medians, and many different statistical expressions of the data, satisfying a broader range of users and ensuring the highest quality database.

In data compilation, all food composition data can be included in the database. Complete information for all components in all foods is not necessary. Ideally, a database with one thousand foods should have complete information for core nutrients, but should also be able to accommodate sporadic data for other components in the foods included. The early work of INFOODS included the development of standards and guidelines for compiling food composition databases for national and regional use (Rand et al.

These standards are being maintained and further developed by INFOODS expert committees and working groups. With appropriate data compilation, food composition data can be disseminated in many different forms to satisfy all user requirements.

Table 1 shows examples of some of the common forms in which food composition data are disseminated. Data disseminated as a set of relational files offers users with very specific needs, or those with customized software, the opportunity to use the data as they wish.

Other common dissemination formats provide the types of information most often required by users. Different countries have different approaches for charging, or not charging, for their data and data products. The United States Department of Agriculture prepares the largest single body of food composition data in the world and disseminates it freely on the World Wide Web, as both a downloadable set of relation files and a searchable reference volume.



