1/
This paper has been reproduced as submitted.
It has been issued without formal editing.
Introduction
Census is the main source of
population data in the country. Its chief merit
lies in giving socio-economic and demographic
data even at micro level like village or enumeration
block in a town. The office of Registrar General,
India (ORGI) under the Ministry of Home Affairs
, Govt. of India is an independent authority for
conduct of population census and related surveys.
The population statistics so generated shall also
form part of the National statistical database
which mutually support one another as an integral
part of the National statistical system. The recently
conducted decennial Census of India in Feb-March
2001 was the 14th in the series since 1872 and
the 6th since independence. According to Census
of India 2001, the population of India at 0.00
hrs of 1st march 2001 stood at 1027 million. As
expected it has crossed the one billion mark.
It is certainly most unlikely that in the history
of mankind, any country other than India and China
would be shaping the lives and fortune of over
a billion people. This paper gives an idea about
data collection methodology adopted and
application of information technology for
processing and dissemination of population
data in Indian context.
Data
collection
The census Operations in India
are carried out in two phases - the House numbering
and the Houselisting operations followed by the
Population Enumeration. The objective of the House
Numbering and the Houselisting operations is to
systematically list out all structures, houses
and households throughout the country for preparation
of a sound frame for conduct of the population
enumeration as well as to generate a comprehensive
database on the housing stock across the country,
its quality and the living conditions of the occupants.
The Houselisting operations was conducted during
April-September, 2000. The Population Enumeration
was undertaken through door-to-door enumeration
by specially appointed and trained enumerators,
between 9th to 28th February, 2001(both days inclusive)
with a revisional round from 1st to 5th March
2001.
Data
preparation (Data Entry)
The bottleneck area of census
data processing in India is really the conversion
of document-based data into computer readable
format. Newer technologies for data entry were
introduced in every round of census in India taking
advantage of the advancement in the field of Information
Technology. During 1981 census, key to disk
equipments were used for data entry followed by
introduction of on-line data entry systems
(PC-386 based) in 1991 census. Keeping in
pace with technological changes, this time, Census
office has decided to implement Image based
automatic form processing technique for electronic
data file preparation combined with image enabled
Computer Assisted Coding (CAC) of 2001 Census
data. Pilot studies were conducted using different
form processing software and the results were
really encouraging. Briefly, the technique involve
i) capturing of batches of filled in schedules
as electronic digital images using high volume
production Scanners ii) processing of the handwritten
numeric characters in the digital image with the
help of Intelligent Character Recognition (ICR)
engines embedded in the form processing software
iii) manual correction of unrecognized characters
in the above processing stage by using an interfacing
Optical Data Entry (ODE) screen package iv) image
enabled computer assisted coding and v) retrieval
and exporting of data in ASCII file format
for further computer processing. Switch based
Local Area Network (LAN) with powerful micro-computers
is being established in about 15 locations to
decentralize the entire processing activity. It
is hoped that introduction of this new technology
for the first time in Census of India may enable
speedier preparation, analysis and dissemination
of data.
Data
processing
From the traditional manual
method of editing, coding and tabulation, Indian
Census has come a long way to electronic data
processing (EDP). Data processing is undertaken
centrally at Headquarters office at New Delhi.
Absence of an in-house computer system in the
Census office was obviated during 1991 Census
by installing a mainframe computer. Owing to the
huge processing workload a judicious mix of manual
and computer tabulations were used at that time.
Over the years, with mainframe computers giving
way to affordable and manageable microcomputers
in a Local Area Network (LAN), the data processing
of 2001 Census will now be focused on to a group
of networked powerful microcomputers.
With the introduction of imaging technology, it
will be now possible to fully computerize processing
of 2001 census data. A large number of reports
on various subjects at different geographical
level are planned to be generated using IMPS package
developed by International Statistical Program
Center, USA .
Data
dissemination
Earlier, dissemination of data
was confined to bringing out publications as well
as through magnetic tapes. Keeping pace with the
information technological development and to meet
increased demand by data users , since 1991 census
a "data users service center" was established
at headquarters office, New Delhi and data
disseminated in a standard format through diskettes,
CD-ROM. Limited number of unit level data is also
being disseminated on special demand by users.
In the coming years, data dissemination activity
would be more strengthened and decentralized aiming
towards catering to abundant demand of users.
The availability of affordable,
powerful microcomputers, user-friendly data analysis
software packages and electronic highway has a
dramatic impact on data dissemination front of
census organization. Moreover, information
is now carried on around the globe with speed
of light and becoming available to anyone at any
place. In the light of these developments, Census
office in India has already hosted a website (www.censusindia.net)
giving important characteristics of Indian population,
latest list of release of publications etc. The
provisional population totals for Census of India
are already available on the website. Although
a beginning is made in this direction , ORGI is
contemplating to set up network of computers in
states across the country to disseminate data
nationally and internationally under the Internet
umbrella.