UN Web Site | UN Web Site Locator
Home Site map Contact 
ESCAP Statistics Division
ESCAP Statistics Division
 
Third Meeting    
The Third Meeting of the Working Party on the Application of New Technology to Population Data
Bali, 7-9 January 1999

STAT/WPA(3)/19
7 January 1999
ENGLISH ONLY

ECONOMIC AND SOCIAL COMMISSION FOR ASIA AND THE PACIFIC

Working Party on the Application of New Technology to Population Data
Third meeting
7-9 January 1999
Bali

Survey Processing Template (SProceT)
David Archer and Ray Freeman
Statistics New Zealand
Contents

Background

The need for a generalised survey processing system has been apparent in Statistics New Zealand for some time. Key requirements were the ability to rotate processing staff between surveys, improving publication times, improving quality without adding to costs, reducing the risks of errors, as well as reducing development costs. However, the development of a generic system, wasn't considered achievable, given the inherent differences between surveys and the specialised nature of software. Our solution was the development of a "standard" survey processing system template, that could easily be reused. Lotus Notes lends itself well to this approach, in that common functionality can be added simply using "cut and paste".

The standard template is to be used for several surveys over the next 18 months, including, QMS (Quarterly Manufacturing Survey), QES (Quarterly Employment Survey), ERBS (Employment Related Business Statistics), RT. (Monthly Retail Trade) and AES (Annual Enterprise Survey). Other simpler production surveys will also use the template. This will eventually become the standard way to process all business surveys.

The template was designed for business surveys but could also be used for surveys of individuals. The feasibility of using the template for interview surveys has not been tested, but the capture from paper questionnaires or electronic data and all subsequent survey phases is not fundamentally different to business surveys.

Progress to date

The quarterly Accommodation Occupancy survey has been using an early version for two years. The QMS (Quarterly Manufacturing Economic Survey) using the production version of Sprocet has completed its second quarter in production, while the QES (Quarterly Employment Survey) version is being used for the first time for the November 1998 survey.

Our expectations for the system which seemed ambitious two years ago have been exceeded and further additional benefits and development opportunities have now become evident.

Key Strategic advances

The main advantage of Sprocet is integration: being written in Lotus Notes has facilitated fully integrated desk to processing. Lotus Notes is the SNZ gateway to discussion databases and email as well as survey processing using Sprocet.

Sprocet has utilised the power of Lotus Notes views to integrate all survey processing phases from data capture through to output editing. At all stages of the process management information can be produced which gives the staff involved and any other approved user in the organisation the ability to balance work flows and monitor progress against contractual agreements. The dynamic nature of views along with the use of categories, sorting and drill down, have integrated and transformed processes that were previously large batch jobs, into interactive processing.

Sprocet is also very integrated with the Business Frame and the Business Surveys database of respondent information. Both of these written in 'SQL Windows' and located on Sybase databases. The minimum of information is replicated onto Sprocet, the rest is retrieved dynamically during processing and changes identified during the survey process are updated back to the Sybase databases.

The functions of system described below uses the quarterly manufacturing survey (QMS) as the example. The QMS is a sample panel survey of 1,600 manufacturers. The survey collects 19 data items on sales, stocks, expenditures, profits, hours worked. Small manufacturers complete a short form with four data items. Results are published for 10 industry groups and are available 10 weeks after the end of the quarter. The survey is a key component in producing GDP estimates. The survey design allows for births each quarter form the business frame and imputation for non and partial response.

Main Functions

Sprocet itself is organised to match the workflows of a standard survey. The system is organised into 5 key areas, These are:

Management activities, 
Unit respondent functions,
Data capture functions,
Macro Checking functions
and finally the Survey Results area. 

Movement around the system is controlled via a Navigation menu on the left hand side of the display. Individual details are then displayed on the right hand side of the display.
The Navigator Display

This screen provides access to each of the five over arching functions and also access to the Business Frame, and also access to standard Notes applications.

The Navigator Display
The Management Area

This groups together the management functions of the survey. These tend to be the large processing agents, such as the creation of postouts, reminder runs, non-response unit record imputation and sample maintenance. In addition the system also records the run status of these processes, allowing a history of processing information to be built-up over time.

The Management functions Area

The following processes (were created in Lotus Notes) are included amongst those carried out by the agent scheduler.

1. Imputation

The QMS survey uses two methods for imputing for cases where non-response exists. These are:

Historical imputation - the values from the previous quarter are used, after adjusting by an estimate of the "trend" calculated from respondents.

Weighted mean imputation - a mean is calculated from respondents, and used as an estimate for non-respondents.

Each KAU is assigned to an imputation cell. Imputation is carried out within each imputation cell. If there are not enough respondents in an imputation cell, the system combines cells.

2. Estimation

Rate-up estimation is used to produce population estimates in the non-full coverage strata of the various industry groups. Rate-up estimation is where each KAU's value is multiplied by its weight and these are summed up to produce a population estimate. Each KAU in the sample is assigned a weight. The selection cell weight is formed by considering the number of KAUs in the survey population for the particular selection cell, dividing by the actual number of KAUs selected within that selection cell. The selection cell weights are updated quarterly by the addition of births in the survey. Any non-full coverage KAUs which may distort the estimates if we use their selection cell weight are specially treated. Special treatment is the process of reducing a KAU's selection cell weight to 1.

3.Grossing-Up Procedure for Small Firms

This is a special imputation procedure which is applied to all QMS respondents receiving short form questionnaires. The variables not collected from respondents for these KAUs are imputed by multiplying their values for quarterlysalaries and wages for employees by ratios created using past Census data.

The Units (Respondents) Area

This area allows quick access to the units contained within the system. Access can be via specialised views, grouping together respondents with similar characteristics such key firm status, recent births, all live units, all units etc. By selecting the respondent in the view, their basic survey information is displayed, e.g. name and address information, sample weights, survey processing flags, postout and reminder dates, classification information, size details, as well as any comments recorded by the processing sections.

The Units (Respondents) Area
Notes
  1. A "KAU" is the Business Frame reference number given to units that can supply full profit and loss accounting information. A "KAU" is usually equivalent to an Enterprise, but some large enterprises have more than one "KAU".
  2. The names of individual respondents have been deleted to meet confidentiality requirements.
The Answers Area

This area groups together the data capture and edit functions. The initial view records the current rate of progress in processing the incoming questionnaires. Individual data capture of respondents is started from this view. In addition on-line editing rules are created and maintained by the client sections directly into the system.

The Answers Area
The Macro Checks

This area groups together the macro checking of the data captured. It includes those checks done across all the units of the survey. The checks can be carried out at any stage of the survey processing cycle as results are progressively updated as the data is captured. Such checks allow early and consistent checking of the survey trends and likely results. Unusual or unexpected results can be discovered earlier on in the processing cycle.

The Macro Checks
The Results Area

The last area provides quick access to the results of the survey. A variety of results are valuable including weighted results by key classifications, as well as the ability to drill down on the results set e.g. from national level to industry level , to sample strata, to individual respondent.

As much of the system is dynamic a change in one area is reflected in other areas. e.g. an amendment to the data from one respondent is then reflected dynamically in the results views. Likewise a re-run of the non-response imputation system would also be reflected in both the macro checks and results views. 

The Results Area
The Benefits of Sprocet

The integrated nature of Sprocet has the benefits of reducing survey costs, while improving timeliness and survey quality. The concept of reusing a template, and then customising it, gives a standardised client interface across surveys, as well as, reduced development costs.

There are also significant business benefits with using Sprocet. Feedback from the users has been positive with our major clients also re-organising their operations to take advantage of the Sprocet features. The significant feature impacting on clients are:

  • The system contains all the process for the survey within the one system. All the input process, output processes and the associated run time logs are held within Sprocet. There is no movement of data or change of platforms within the processing cycle. The results of all processing runs are available, back over time, for each processing cycle.
  • Easy access to all the processes, regardless of work site or section. Both the input processing staff and the output quality analysis staff are using the same system and viewing data in a similar fashion. For the QMS there are four groups of staff using the system. The respondent liaison section is in our Auckland office, data input, editing is carried out by another group of staff in Auckland. The survey methods staff who have an audit role are in Wellington and the Output analysis and publication is carried out in Christchurch. Each group works on the same database and can view processes carried out by all groups as they occur.
  • The use of Lotus Notes is already universal within Statistics New Zealand, therefore having the survey processing system also contained within Lotus Notes significantly improves the basic familiarity of staff with the system and reduces training time required to a minimum.
  • With the system containing both micro and macro data checks, there has been a significant move away from the checking of micro level data and a significant increase in the amount of macro level data checks.
  • The macro editing checks follow a 'top - down' approach, allowing data to be checked at the survey level, then the industry level and then a drill down feature to view the individual respondents of interest.
  • Finally the system is dynamic in nature. This means that clients have the ability to look at survey level results, drill down to an individual units, alter the data and then have the results now reflect the new data values.
Future developments
Future technical directions are likely to focus on electronic interfaces, with:
  • Electronic means for respondents to supply data, e.g. the supply of input data by fax, the web, email, respondent direct data dumps or via the use of touch tone data entry etc.
  • New options for sending reminders, using email, efax as well as the traditional reminder letter.
  • On-line questionnaire image storage & retrieval;
  • Tax data access: the use of administrative data to impute non response and edit survey data from questionnaires;
  • Direct updating of results into Time Series Manager (TSM).
Contacts for more information
David Archer
Manager Business Development
Internet: david_archer@stats.govt.nz
Ray Freeman
Project Leader Sprocet Development team.
Internet: ray_freeman@stats.govt.nz
Prepared for the ESCAP working group on new technologies for population surveys.
Bali Indonesia
January 1999

 
Pop-IT project (1997-2001)
Project Objectives
Working Party Members
Working Party Meetings
First meeting, Bangkok, 24-26 September 1997
Second meeting, Singapore, 1-3 April 1998
Third meeting, Bali, 7-9 January 1999
Fourth meeting, Manila, 6-9 July 1999
Ffth meeting, Bangkok, 21 October 1999
Sixth meeting, Bangkok, 26 March 2001
Workshops
Application of New Information Technology to Population data, Bangkok, 12-20 October 1999
Population Data Analysis, Storage and Dissemination Technologies, Bangkok, 27-30 March 2001
Guidelines
Population data collection and capture (BBS - Statistics Indonesia)
GPS in modern mapping and GIS technologies to population data (Bangladesh Bureau of Statistics)
Population data dissemination (Statistics New Zealand)
Project Newsletter
Contact us
   
Copyright (c) 2013 ESCAP  |  Legal Notice