Tibetan Buddhism at the Digital Frontier
Robert R. Chilton, Technical Director
The
Asian Classics Input Project (ACIP), USA
ˇ§The 1990s saw the emergence of a new and powerful, global communications technology, which has the potential to be more significant to the development of education, culture and health than the emergence of radio, film or television. Todayˇ¦s low cost personal computers (PCs), together with new storage media, emerging communication networks and an increasing range of commonly available peripheral equipment and software, now present a highly cost-efficient means to incorporate text, sound and images into an integrated, interactive, mass communications medium.ˇ¨
ˇV from a technical assessment by The Orient Foundation (U.K.)
This
paper will focus on the potential interface between modern digital technology
and traditional Tibetan Buddhist culture. It
will not deal with the many ways that modern scholarship on Buddhism is being
affected by the digital revolution, although much could be said on this subject.[1]
Nor will it consider in detail how the new media technology is
facilitating the spread of Tibetan Buddhist knowledge and practice among people
living in societies that are not traditionally Tibetan Buddhist.
It will, in passing, address some issues that border on the field of
development such as transfer of technical knowledge, sustainability and local
capacity building via vocational training and employment opportunities.
This
is a critical time for Tibetan Buddhist culture.
Suffering under repressive regimes in the former Soviet Union, in
Mongolia, and in Tibet under communist Chinese rule, the traditional culture was
suppressed, often violently, for much of the 20th century.
This heritage, which originated more than 2,500 years ago, is now at the
brink of extinction. Most of the
cultural institutions, monasteries and libraries were closed or destroyed and
countless books and works of art were lost during the past seventy years. The surviving textual materials are scattered, uncatalogued
and deteriorating. The
traditionally trained masters, often displaced from their traditional
homelands, are ageing
and dying before they can pass on their knowledge and skills to the next
generation. A relative handful of
Tibetan refugees have struggled over the past forty years to re-establish their
culture outside of Tibet and transmit it to succeeding generations.
In Mongolia, Siberia and elsewhere, the living classical culture is
struggling to emerge from decades of political suppression and economic hardship
and, in addition, faces the challenges of modernization.
Against this backdrop, we come to the central question to be explored in this paper: How can the technology of the 21st century ˇ§information ageˇ¨ be effectively utilized to help preserve and strengthen Tibetan Buddhist traditions of learning and practice? And how can traditional Buddhist institutions selectively adopt and apply appropriate technologies to advance their core mission of training the next generation of master teachers?
In
considering these questions we should observe that, just as in the cases of
previous technological innovation such as writing and printing, modern digital
technologies have the potential both to aid and to dissipate the continuity and
vigor of the Buddhist traditions. However, aside from merely noting that modern technology has
the power to distract people from the ultimate aims of Buddhism, discussion of
the possible negative aspects of adopting technological innovations lies beyond
the scope of this paper. Nevertheless,
it may be instructive to relate a story that is told in Tibet:
Once there were two Buddhist monks who were progressing very well in their spiritual practices toward liberation and enlightenment. Seeing this, a demon became jealous and decided to interfere with the monks, to hinder their progress. Now these monks, in the course of their studies, were learning Sanskrit so that they could read the texts in that Indian language (from which the Buddhist scriptures in Tibet had been translated). The demon created a magical pen that had the power of writing Sanskrit, including the difficult rules of euphonic combination, called samdhi, perfectly and with ease. Upon finding this pen the monks were so enchanted by its powers, by what they could do with it, that they neglected their practice and were thus distracted from accomplishing their spiritual goals.[2]
This
story clearly warns of the dangers of getting distracted by the potential of the
technology itself and emphasizes the importance of maintaining a perspective
that sees technology as merely a tool for accomplishing oneˇ¦s real goals.
We
can see that the Buddhist tradition has embraced various technological
innovations in the past. Consider
that approximately 2,000 years ago the Buddhist teachings began to be recorded
in written form, marking a transition from what had been an exclusively oral
tradition.[3]
About 1,000 years later, the adoption of various forms of printing
sparked a revolution in the availability of Buddhist texts.[4]
Moreover, it might be argued that developments in Buddhist drawing,
painting, sculpture and even architecture represent evolving ˇ§technologiesˇ¨
for capturing spiritual understanding and visionary experience in physical
formˇXsimilar to the manner in which writing was (and is) used to capture
spoken discourse. Much more
recently, the development of photography as well as techniques for recording
sound and moving visual images, in short, the ability to capture information
from a particular event in time and replay it at another time and place, has
been revolutionary. Still more
revolutionary has been the development of broadcast communications, principally
radio and television, which have enabled millions of people to hear and see
events happening far away at the very moment when they are occurring.
There is no denying that all of these technological innovations have had an extraordinary impact on human affairs. Given the advanced state of analog technologies such as photography, audio recording, film and video recording, radio and television, we might ask what added benefits can be provided by digital technology. To help answer this question, let us begin by looking at some of the distinctive characteristics of digital technology.
Unlike
analog materials, digital content can be rapidly transmitted over networks and
duplicated onto inexpensive media without loss of information. Digital
materials can be interlinked in new interactive ways that transcend the
limitations of the original media types and they can be searched and otherwise
manipulated by automated processes far more easily than can analog materials.
Finally, and most significantly, the Internet enables multidirectional
sharing and communication of digital resources independent of time or location;
this stands in contrast to the one-way, locale- and time-specific
characteristics of standard broadcast communication technologies such as radio
and television. Disseminating
information over the Internet, via a continuously available web page, is an
affordable and highly cost-effective way to reach a large audience compared to
other modes of communication (audio and video tape, printed text and photos,
radio, television, postal mail, conventional publishing and so on) and adds the
capability of enhanced user interactivity.
In the context of Tibetan Buddhism, digital technology enables preservation of endangered cultural materials and traditionsˇXsuch as texts, artwork, recorded oral explanations, or videotaped demonstrationsˇXin a form that is portable and compact. Copies of digitized materials can be made quickly and easily at little expense, and with no loss of information from one copy to the next. Huge quantities of digitized materials can be widely distributed at very low cost: an entire library of books that would fill several large rooms can be distributed in digital form via inexpensive compact discs, or transmitted over the Internet in a matter of minutes. Digitized materials can be integrated in new ways by cross-linking text, images, sound recordings, and video. For example, a text passage explaining how to perform a Buddhist ritual can be linked to both an oral explanation of the passage and either still or moving images that demonstrate the ritual. Since digitized materials lend themselves to efficient searching and analysis, automated searches of digital text databases can be used to quickly and thoroughly locate specific passages or discussion on a given topic within a huge corpus of material. Citation searches that would require weeks of tedious effort using manual methods can be performed in just a few minutes.
What
are the various types of digital resources?
In a very general way we can make a division into two categories:
character-based (text) and non-character-based (still images, audio and
motion video). Digital textual
materials offer many advantages over printed books.
Electronic texts, catalogs, and dictionaries can be searched or indexed
in an automated and robust fashion. The
availability of a large searchable corpus of digitized texts enables
wide-ranging research across multiple works, efficient confirmation of citations
and bibliographical references, automatic comparison of alternate editions of a
work, and the use lexical analysis to draw more accurate conclusions about the
authorship, geographical origin, and relative chronology of texts.
And since size (page count) is no longer a limiting factor in digital
materials, reference works such as catalogs and encyclopedias in electronic form
can be far more detailed and extensive than their printed counterparts.
In
contrast to character-based digital materials, non-textual materials such as
images, audio, and motion video cannot be automatically searched and
manipulated. Consider the example
of a digital image of a page from a rare book: the information it contains is
not searchable unless further processing, such as optical character recognition,
is applied. And we can easily see
how this limitation extends to digital images of artwork or digitized audio
recordings of oral commentaries. As a way to partially overcome this limitation, those who
assemble large collections of digital images and audio/video resources typically
create extensive and detailed catalogs of their materials; these character-based
catalogs can then be searched, indexed and otherwise manipulated.
Unlike
traditional printed books, digitized materials need not be restricted to a
sequential format. As exemplified
by the web pages of news organizations such as Reuters, BBC and CNN, digital
text can be extensively annotated and linked with images, audio, video and other
passages of text without disrupting the flow of the narrative.
Early
efforts to apply digital technology to the work of preserving and disseminating
Tibetan Buddhist culture began soon after the introduction of the personal
computer. As early as 1983,
researchers from The Asian Classics Input Project (ACIP) created a Tibetan word
processor for the IBM-PC computer and subsequently developed a methodology for
digitizing Tibetan language materials. This
methodology has since been used to create a virtual library of classical Tibetan
Buddhist literature totaling more than 150,000 pages.[5]
There
were a number of challenges facing ACIP when they began this work.
Because of the difficulties involved in deciphering the woodblock-printed
texts, it became clear that the work would be best accomplished by fluent
Tibetan-speakers. Furthermore,
because of the massive amount of the target literature to be input, and with
only modest available funding, ACIP developed a scaleable and cost-effective
methodology: training and employing Tibetan refugees in India to type the texts
using low-cost personal computers.
Due
to both the lack of support for display of Tibetan script in early computers,
and the need to produce digital materials in a universally useable format, the
texts are entered in roman-letter transliteration.
In order to reduce errors during input, each text is typed by two data
entry operators and then an automated comparison program is used to identify and
correct any differences between the two typed versions.
As a happy by-product of this methodology, the data entry operators
acquire a marketable skill since they gain experience in the use of computer
equipment while learning to touch type on a standard (QWERTY) keyboard.
In
1990, when ACIP began public distribution of the digitized texts to libraries,
researchers and cultural institutions, the standard computer media was the
5.25-inch diskette, capable of holding up to 600 pages of text.
By 1993, when the text database had grown to a size where distribution on
diskette was impractical, the CD-ROM drive had become standard equipment in
personal computers. ACIPˇ¦s 1993
CD-ROM release, entitled Woodblock to Laser, totaled some 70 megabytes of
data, or 35,000 pages of text. With
the advent of the Internet, and especially the WorldWideWeb (in 1995), huge
quantities of these digitized materials began to be widely distributed at
virtually no cost.
ACIP-trained
cataloguers have been working since 1994 with library staff at the Institute of
Oriental Studies (Russian Academy of Science) and the Oriental Library of the
University of St. Petersburg to create comprehensive electronic catalogues
of the important and extensive collections of Tibetan materials housed in St.
Petersburg. Each
title in the collections is treated as a separate work in its own right, and
complete cataloging information is given for each one.
Since the catalog is computerized, it is possible to include a level of
detail not achieved in any previous printed catalogs.
Therefore, a total of twenty-two different details of each title are
recorded, including the entire colophonˇXcontaining composition and printing
information along with valuable information on the Tibetan teaching lineages and
other historical information.
Over
80,000 separate titles have been cataloged to date, with work projected to
continue for another five years or more. ACIP has made a commitment during this time to assist the
Tibetan Collection of the Institute of Oriental Studies in setting up
reproduction facilities that will allow the Institute to fulfill orders from
interested persons for copies of texts they need to work with. It is hoped that the existence of this reproduction facility,
and the fees paid to the Institute by scholars for copies, will help defray the
considerable cost of maintaining this precious treasure for future generations.
In
Ulan Bator, Mongolia, ACIP has recently started similar cataloguing projects in
cooperation with the Mongolian State Library and the national monastery of
Gandan Tekchen Ling. The huge[6]
and relatively inaccessible collections of Tibetan Buddhist literature housed in
Mongolia are thought to contain many texts that exist nowhere else in the world.
Although
early use of the digital materials was limited to those with access to a
personal computer, primarily researchers, translators and scholars in the
industrialized nations, by 1996 Tibetan refugees had started a revolution in the
availability of Tibetan literary materials by using the digital texts as the
basis for computerized typesetting of printed textbooks.
As a result, the number of Tibetan titles being published each year by
the relocated monasteries in South India has increased more than tenfold from
what it had been before the introduction of digital technology (when the books
had to be set by hand in metal type). Beyond
the increase in the number of titles available, the books typeset by computer
are more legible and typically contain fewer typographical errors than previous
editions. Recognizing the
significance of this new use of digital technology, and also the urgent need for
providing opportunities in vocational training and employment to young Tibetans
entering the workforce, Tibetan resettlement officials have allocated funds for
the construction and equipping of new data entry centers in the refugee
communities.[7]
Up
to the present, this has been the extent of benefit from digitization for people
in the Tibetan cultural sphere; there are still significant barriers to their
taking full advantage of the digital revolution such as the lack of access to
networked computers and the lack of standardized support for computing in
Tibetan. As we shall see, these
barriers are just now beginning to be overcome.
As
we ponder the question of how digital technology might be creatively applied to
help alleviate the very real threats to the survival of Tibetan Buddhist
civilization as a vital and living cultural tradition, we should take a moment
to consider some of the relevant characteristics of traditional Tibetan Buddhist
culture.[8]
There
can be little doubt as to the importance of this cultural heritage.
The vast majority of Tibetan Buddhist literature can be traced back to
classical Indian sources, as exemplified in the Kangyur
and Tengyur, two collections of more than 300 large volumes that were
carefully translated into Tibetan, primarily from Sanskrit, by
generations of scholars. The
ideas and cultural values contained in these texts have shaped Buddhist
civilization throughout Asia for two and a half millennia.
Many of the original Sanskrit texts,
including records of teachings by the historical Buddha (circa 500 b.c.e.),
have since disappeared and now survive only in
their Tibetan translation. During
the past thirteen centuries, native Tibetan and Mongolian authors have written tens
of thousands of works based on these two classical collections and
have thereby created an extraordinary
body of knowledge that spans a wide range of
topics including: logic and perceptual theory, epistemology, hermeneutics,
medicine, astrology, ethics, psychology, metaphysics, poetry, rhetoric,
government, meditation techniques, architecture and the arts.
Traditional systems of Tibetan Buddhist education are founded on
intensive study and memorization of these classical materials.
Moreover,
the Tibetan Buddhist traditions have preserved unbroken lineages of practice and
teaching that stretch back to the earliest beginnings of Buddhism.
For many hundreds of years, until the middle of the 20th
century, the monasteries of Tibet were active centers of intense philosophical
study and debate, meditation, and the arts. Each
generation gained mastery of the essential spiritual knowledge through
face-to-face giving from teacher to student.
But
the sad events of the past seventy years have come close to erasing Tibetan
Buddhist culture from the face of the earth.
Beginning in the 1930s in the former Soviet Union and Mongolia, Tibetan
Buddhist traditions were suppressed, the monasteries and libraries were
destroyed, and the monks and lamas were either killed or forced into lay life.
Two decades later a similar fate befell Tibet with the result that over
100,000 Tibetan refugees now reside outside of Tibet.
As
we begin the 21st century, the entire Tibetan Buddhist cultural
sphereˇXwhich includes Kalmyk-Mongol lands bordering the western shore of the
Caspian Sea, vast regions of the Siberian and Mongolian steppes, the Tibetan
Plateau and much of the HimalayasˇXis facing a multitude of physical, political
and economic challenges. Both the
surviving cultural materials, such as the books and artwork, and also the
instruction lineages, which have passed unbroken from teacher to student for
more than two thousand years, are severely endangered.
Communities in the region are separated from one another by both physical
distance and political boundaries while refugees struggle to maintain their
Tibetan Buddhist culture in their diaspora throughout South Asia and around the
world. Whether located in their
traditional homelands or in refugee camps, all of these Tibetan Buddhist
communities are currently cash-poor. And
because Tibetan Buddhist civilizations have historically had economies founded
primarily on subsistence-based agriculture and livestock herding, they are
having difficulty adapting to the modern global economy.
So how are these characteristicsˇXof being important, endangered, fragmented and widely spread, and cash-poorˇXrelevant to the application of digital technology? As shown in the examples given earlier, digital technology can be effectively used in the preservation and dissemination of endangered cultural materials. Further, modern global communications eliminate many of the barriers of physical distance by enabling people in widely separated locations to access digital resources and interact with each other in the ˇ§virtualˇ¨ time and space of the Internet. And while the costs of new media technologies have been steeply falling over the past decade, the speed, capacity and functionality of information and communication technologies have increased dramatically.
As
information and communication technologies take on an ever-increasing role in
human affairs, there is growing concern about the widening gap between the
technological haves and have-nots.[9]
However, there is reason to believe that the ˇ§digital divide,ˇ¨ which
is particularly evident in the societies of the Tibetan cultural sphere, may
soon be bridged. There are three
major reasons for the fact of the digital divide in these communities.
Because of the lack of cash, on both a per capita and an aggregate basis,
there has been very little investment in computer equipment and connectivity,
especially when we take into account that the cost of computers, digital storage
media and telecommunications was recently 3 to 20 times higher than at present.
Stemming from this lack of technology infrastructure investment, there
has been little economic (or even political) incentive for the creation of
standards for representing Tibetan-script materials within digital environments
nor for the development and implementation of robust software applications for
working in Tibetan. Then, on the
basis of these two circumstances, there has been scant opportunity or incentive
for the creation of digital resources in Tibetan language.
Furthermore, due to the lack of an installed base of computers and other
digital technologies, people living in Tibetan Buddhist societies have had
little chance to develop technical expertise or experience, much less to find
employment within a digital technology field.
Let
us take the example of a high school student in an English speaking country such
as the United States who wishes to write a paper on a particular topic of
American culture or history. He or she can very likely use a computer with Internet access
at home or, barring that, at a nearby school or library. The menus and messages in both the computer operating system
and the applications for word processing and web browsing will, naturally, be
displayed in English. The student
will find a large number of websites containing information, in English, that
can be used in writing the paper and he or she can also use the Internet to
learn about (and even to obtain) sources of further information, such as titles
in the collections of local or regional libraries or relevant books for sale at
online booksellersˇXall in English.
In
contrast, consider the plight of a high school student in the Tibetan cultural
sphere who wishes to write a paper on some aspect of Tibetan culture.
Even if he or she can find a computer to use, the menus and messages will
not be displayed in Tibetan. Gaining
access to the Internet will likely pose another challenge, although the recent
advent of ˇ§Internet cafesˇ¨ is beginning to alleviate this particular
obstacle. The obstacle of language is seen again in both the lack of
built-in support for Tibetan in current web browsers and the resulting dearth of
Tibetan-language web pages. Viewing
any such sites requires the user to download and install special fontsˇXwhich
are usually specific to a particular operating system: Windows, Macintosh,
Linux, and so on and, furthermore, are mutually incompatible.[10]
With only a few websites to choose from, there is little chance that the
student will find information of the type being sought.
Thus we can see why some Tibetans consider the lack of computer support
for Tibetan language one of the greatest threats to the continuity of Tibetan
culture, especially among the youth.
But all this is about to change. With the recent steep plunge in the costs of computer equipment and connectivity, many of the monasteries, schools, and other institutions in the Tibetan cultural sphere are beginning to connect into the global information network. And with the establishment of the Unicode (ISO-10646) standard[11] for representing all of the worldˇ¦s languages within a single character-encoding scheme, we can anticipate widespread support for Tibetan script within computer environments, including word processors and web browsers.[12] The increasing number of computers tied into the Internet in Tibetan-speaking regions, in combination with built-in support for using Tibetan on those computers, will undoubtedly stimulate both the creation and utilization of digital resources related to Tibetan Buddhist culture.
The universal adoption of the CD-ROM drive as standard equipment in personal computers, coupled with the fact that a well-equipped multimedia computer now sells for approximately US$2000 has brought about entirely new possibilities for creating, distributing and viewing digitized cultural materials.[13] A single inexpensive CD can provide durable and universally accessible storage for up to 100,000 pages of multilingual text, 40 hours of voice, 10 hours of music, 100 high-resolution full color photographic images, or 45 minutes of full-screen film or video.[14] At the same time we have witnessed stunning advances in the bandwidth[15] of connections between individual workstations and the Internet; whereas a connection speed of 14.4kbps was previously the standard transmission rate over copper telephone wires, data transmission speeds near 56kbps are now common. So-called ˇ§broadbandˇ¨ connections such as ISDN and especially cable, satellite and DSL services provide transmission speeds that are typically about twenty times faster than 56kbps dial-up connections, at reasonable cost.[16]
A
number of factors need to be in place before digital technology can be
effectively applied to the task of recording and providing useful access to
classical Buddhist materials within the Tibetan cultural sphere.
First, universal standards for the various media types (texts, images,
sound, motion video) must have been established and adhered to so that the
digitized cultural materials can be stored, used and re-transmitted without the
danger of corruption, incompatibility or near-term obsolescence.
Second, following on the adoption of standards, support for those
standards in operating systems and applications programs must be implemented.[17]
Third, computers that incorporate standardized support for all relevant
media types, along with Internet connections of sufficient capability, must be
installed in Tibetan-speaking communities.
Fourth and finally, people in the local population must be trained in how
to manage and use the new technology.
Standardized
support for digitized images, sound, and motion video is now well established
across multiple operating systems, largely due to the pervasive influence of the
Internet in general and the WorldWideWeb in particular.
Uncompressed TIFF (Tagged Image File Format) and compressed JPEG/JFIF[18]
are widely supported de facto standards for digital images.
A number of standards for digital sound and motion video have become
widely adopted and supported within computer environments, starting with the
ˇ§Red Bookˇ¨ audio standard used for common audio (music) compact discs.
Several non-proprietary standards for audio/video have been established
by the Moving Picture Experts Group (MPEG)[19]
such as the MPEG-2 video compression format used for standard DVD video.[20]
The wildly popular MP3 format for compressed digital audio is officially
known as MPEG Audio Layer 3.[21]
Meanwhile,
the lack of a standard way to represent Tibetan-script materials electronically
has been a major obstacle to the spread of computerization within the Tibetan
cultural sphere. It has also been a
persistent irritant to researchers in the field of Tibetan Studies who have
argued about the relative merits of one proprietary system over another without
reaching any resolution. Non-Tibetan
researchers and academics have generally favored using one or another system of
transliteration into roman letters[22]
whereas most Tibetan users have preferred to use one of the proprietary software
programs that display Tibetan materials directly in native script using special
fonts.[23]
Because each of these different Tibetan programs have their own way of
representing Tibetan in the computer, people working with Tibetan-script
materials are often unable to share their work with others, and even basic
written communication in TibetanˇXsuch as via emailˇXis problematic.
With the exception of electronic documents that represent Tibetan using
plain-text roman letters (as in the ACIP system), digital Tibetan material
cannot easily be ported between operating system platforms[24]
and interoperability of Tibetan data stored in different formats, even on a
single platform, is virtually unknown.[25]
The
Unicode standard for representing multilingual text in digital environments
finally solves the problem of how to share Tibetan-script materials between
different applications and different operating system platforms.
This fact was recognized during a recent conference,[26]
attended by many of the major producers of both digital Tibetan texts and
Tibetan-script computer software, where the participants agreed to adopt Unicode
as the standard format for interchange of Tibetan materials.[27]
This decision represents a departure from past practice of using
roman-letter transliteration as the common import/export format for Tibetan
materials. It also provides a
framework within which individual users can continue to work in their current
computer environmentsˇXmost of which do not yet support display of Unicode
Tibetan and hence must employ one or another proprietary scheme for displaying
Tibetan scriptˇXwhile ensuring that their work can be shared with others and
can be used in future computer environments that support Unicode Tibetan.
Beyond Unicode, a second significant development is seen in the area of standards that define the general format of documents. To the extent that the Internet allows users in different locations, using different computer equipment and having different interests to share digital resources, it is essential that standardized formats are defined and adhered to. For example, when multiple independent institutions engage in the cataloguing of Tibetan materials, their work will be most useful to the wider community if they follow a common cataloguing format since standardized catalogue records can be accessed through a variety of means and can be easily unified.[28] The 1990s saw the establishment of a number of important formatting standards. Most prominent among these has been HTML (HyperText Markup Language). Although HTML is merely one instance of SGML (Standard Generalized Markup Language), as the driving force behind the WorldWideWeb it has revolutionized the Internet and the manner in which we now access and share information. A further instance of SGML, called XML (eXtensible Markup Language), builds upon the success of HTML while going beyond its limitations. Especially important for our purposes is XMLˇ¦s ability to explicitly define both the structure and the content of the data, enabling the data to stand independent of any particular application or context. And because it fully incorporates Unicode, XML represents the first truly practical standard for the preservation and dissemination of marked up Tibetan-script materials in a durable, portable and interoperable digital form.
Although
we might assume that worldwide adoption of the Unicode standard necessarily
entails that Tibetan (and all other scripts) will be supported in computer
environments, in fact the definition of the character set for Tibetan script in
Unicode is only one essential ingredient in the recipe for standardized support
for Tibetan. The other ingredients
include rendering (displaying the Unicode characters in Tibetan script),
keyboarding (defining how to create the various letters and letter combinations
when typing) and string ordering (used when sorting words in dictionary order).
Ancillary ingredients include converters from legacy Tibetan script
formats to Unicode, lexicons for use by automated spell-checkers, and so on.
The
Dzongkha Development Commission in Bhutan and The Orient Foundation (U.K.) have
been working for a number of years to specify and develop all of the various
tools that are required to support Unicode-based computing in Tibetan-script
languages such as Dzongkha, the national language of Bhutan.[29]
Wherever possible these Unicode components are defined within the
framework of open international standards so that they can be readily
implemented across a variety of operating systems and platforms.
Dzongkha-specific tools for rendering, keyboarding, string ordering, and
conversion of legacy data are now nearing completion.
With these essential components in place we can reasonably hope to see
standardized support for Unicode Tibetan within the next year or so.
Once
Unicode support for Tibetan is widely implemented we can anticipate some
immediate benefits. At present,
proprietary (and incompatible) font-encoded formats are the most suitable
formats for publishing printed Tibetan books but they are the least suitable
formats for searching, indexing, interchanging data, manipulating information in
databases and spreadsheets, and so forth. Roman
transliteration, although an excellent basis for searching, interchange and
other generic processing, must be converted to Tibetan script prior to
publishing. After the data is converted from roman letters to Tibetan
letters, care must be taken that any corrections made during proofreading of the
Tibetan-script version are subsequently reflected in the roman-letter version, a
step that is often overlooked. Furthermore,
most formatting information that is added during the process of laying out the
text for publishing cannot readily be mirrored in the roman-letter version.
In contrast, support for Unicode Tibetan will mean that all text data
used during computer typesetting of books for publishing will be immediately
available in a format suitable for integration into a searchable text database,
and vice versa. In short, Unicode
support will enable all digital Tibetan materials to be directly readable,
searchable and printable in Tibetan script, with any corrections or added
formatting applied to the digital text itself rather than to a derivative
version.[30]
Responding
to the urgent need for securing and revitalizing Tibetan Buddhist culture, four
of the leading Tibetan cultural conservancy organizations joined together in
February 2001 to form the Tibetan Knowledge Consortium.[31]
Over the past two decades the individual consortium members have
developed relationships with the prominent lineage holders and have established
cultural documentation programs with more than forty monasteries, libraries and
other institutions throughout the Tibetan cultural sphere.
In collaboration with its affiliates and partners in Asia, the consortium
is now leading an initiative to collect, conserve, integrate and provide
universal access to the extant literary and living cultural traditions of
Tibetˇ¦s endangered classical heritage. This
initiative, entitled The Classical Tibetan Knowledge Program (CTKP), utilizes
the above-mentioned international standards and recent advances in digital
technology to implement far-reaching programs on a relatively modest budget.
As
a necessary foundation for undertaking the mission of the CTKP, networked
multimedia computers, digitizing equipment, digital storage media and other new
media technologies are being installed at 30 major libraries and monasteries in
the Tibetan cultural areas. Locally
based librarians and other community members are trained in management of these
facilities and utilization of the digital resources.
Training and employment opportunities are also provided in the key tasks
of digitizing Tibetan Buddhist cultural materials via:
ˇP
Creating
comprehensive electronic catalogues of surviving Tibetan literature.
ˇP
Inputting
the contents of traditional woodblock printed texts to create a searchable
digital library of classical Tibetan texts.
ˇP
Using
digital technology to create annotated critical editions and refined English
translations of key Tibetan works.
ˇP
Recording
and digitizing oral commentaries given by senior lineage holders and master
teachers.
ˇP
Producing
visual documentary records relating to religious instruction and rituals,
medicine and the arts, using digital video technology.
Once
standardized support for Tibetan is widely available, the digitized cultural
materials will be collected together, integrated and cross-linked within an
online multimedia repository that can be accessed via the Internet from anywhere
in the world.[32]
In this way, digital technology enables endangered texts and instruction
lineages to be recorded and re-transmitted across the barriers of time and
physical distance at very low cost.
Other
significant features of this initiative:
ˇP
Priorities
for cultural preservation are determined in consultation with senior lineage
holders.
ˇP
Monasteries
and other traditional institutions are offered guidance and technical assistance
in using digital technology to further their overall mission of transmitting
knowledge and skills from one generation to the next.
ˇP
Technical
knowledge is transferred to the local communities so that program management and
development can be devolved to indigenous institutions in the regional area.
ˇP
Portals
to the online resource repository enable Tibetan Buddhist groups and Tibetan
cultural organizations worldwide to make their own digital resources available
to others on the CTKP website.
ˇP
A
comprehensive online directory of worldwide Tibetan cultural materials,
maintained by the consortium, can be used to locate sought-after resources.
ˇP
Local
educators are trained in the authoring of sophisticated multimedia instructional
modules.
ˇP
A
database system for cataloguing Tibetan cultural materials is being developed
using non-proprietary standards-based tools for multilingual computing.[33]
ˇP
Infrastructure
development and training programs promote enhanced communication (via email),
dissemination of cultural information (via upload to, and download from, the
website) and the growth of ˇ§information communitiesˇ¨ (such as mail lists,
chat rooms electronic bulletin boards and news groups) among people and
institutions throughout the Tibetan cultural sphere.
ˇP
Digital
resources in English translation, made available to the general public, can help
people everywhere gain a deeper understanding and appreciation for the
philosophical, artistic and cultural traditions of Tibetan Buddhist
civilization.
ˇP Essential source materials in digital form, made available to international scholars, researchers, libraries and academic institutions, serve as the basis for contemporary scholarly research, analysis and the preparation of authoritative translations and instructional materials.
We
have already seen some examples of how the availability of these digital
resources, along with the tools and infrastructure needed to support interactive
computing in Tibetan, will help stimulate revitalization of traditional Tibetan
Buddhist culture within the indigenous societies.
Digitized audio and visual recordings of the living instruction lineages
serve as the basis for study, teaching, and the preparation of multimedia
instructional modules while digitized texts are used as the basis for
invigorated book publishing in Tibetan communities across South Asia.
Furthermore,
taking Tibetan literary materials as an example, the availability of the books
in digital format (whether as scanned images or typed input), assures that
teachers and students will have access to needed texts.
Teachers can select and extract passages from a huge corpus of material
and can, on demand, print just the desired pages in the exact quantities needed,
such as for teaching a class. Because
a digital library is highly compact and portable (it can fit on just a few CDs)
and because individual texts can be easily transmitted via electronic means,
teachers will rarely lack a text they needˇXwhether as the basis for expanded
oral explanation or for conducting a particular Buddhist practice or ritual. We can imagine a scenario where a lama travels to a
particular place in order to give a teaching and then learns at the last minute
that the scheduled teaching has been changed and he will need a book that he did
not bring along with him. Or
perhaps a statue needs to be consecrated or a particular ritual needs to be
performed and the lama lacks the required texts in printed form. It is easy to see how the availability of texts in digital
form, either on CD or via the Internet, might be very helpful in such
circumstances.
The existence of comprehensive and searchable texts and reference works will facilitate the creation of thorough and accurate translations of Tibetan Buddhist books into other languages. These translations, in turn, will not only help to bring the seminal Buddhist values of non-violence, wisdom, compassion and ethical self-restraint into the wider world but will also be useful to Tibetans who wish to teach Buddhism in languages other than Tibetan.
Essential
though it may be, access to digitized books and other cultural resources is not
sufficient to save the living culture.[34]
The living culture is embodied in the people and especially in the highly
trained masters. To fully
appreciate this point we need to recognize what it means to be a master, to
possess true mastery of a particular body of knowledge or technique.
Let us consider the example of mastery in the field of music.
Although both masters and students might benefit from the availability of
sheet music, merely preserving the musical scores cannot, by itself, preserve
the living tradition of musical mastery. Similarly, preserving the musical instruments, the concert
halls, and even the recordings of masterful performances is not sufficient to
preserve the living tradition. Mastery
is gained in the context of the interaction between teachers and students, in a
disciplined manner, over the course of many years. The existence of schools and training academies is thus a
principal requirement for the continuity of these living traditions of mastery.
Whether we are speaking about master artists or musicians, masters of the
martial arts, or meditation mastersˇXtheir knowledge, experience and personal
development can never be fully captured in words, or even on film or videotape. Mastery is lived experience and as such can only be
transmitted from teacher to student through direct instruction and training.
Thus,
in order to truly maintain the continuity of this 2,500-year-old tradition, the
monasteries and other training institutions must be supported both materially
and technically. Whatever attempts
are made to advance Tibetan Buddhist culture must therefore incorporate the goal
of ensuring that there continue to be qualified master teachers (lamas) in the
world who are able to correctly teach the ancient wisdom that is recorded in the
books. Without qualified teachers,
the books themselves are practically useless, no matter how perfectly they are
preserved. The true goal of the books, in conjunction with the oral
instruction lineages, is to produce a special kind of person: ethical, mindful,
wise, compassionate, skilful, logicalˇXin short, a person who possesses mastery
of these inner arts.
We
have seen that the need for support of traditional Tibetan Buddhist culture is
urgently felt and can be addressed in certain ways through the creative
application of digital technology. At the same time, whatever support is given should lead to
the creation of sustainable infrastructure, programs and services.
In particular, there must be investment in training the indigenous people
of the Tibetan cultural sphere, to build local capacity for carrying forward
these important cultural revitalization activities.
The CTKP, for example, has specific plans within its overall program for
devolving development and management of its programs and facilities to the
regional communities and institutions.
Channels
must be established that enable technical, financial and material support to
flow into the monasteries and other cultural institutions within Tibetan
Buddhist societies. Members of the
Tibetan Knowledge Consortium have been successful in obtaining material support
in the form of donations of new and used computer equipment and computer
software from corporate sponsors. Private
foundations and individual donors have contributed generously to the monasteries
to support housing, scholarship and food funds, and also sanitation, water and
health projects. As these channels
become well established, fund raising and administration can be devolved to the
local institutions themselves.
The availability of digital Tibetan Buddhist resources can also play an important role in building self-sufficiency within the monasteries and other classical Tibetan institutions. Beyond the obvious merit for both teachers and students, the teaching of Tibetan Buddhism in languages other than TibetanˇXfacilitated by digital resources and reference materials that support translation activitiesˇXwill enable a wider audience to appreciate the importance of the monasteries and other traditional institutions where the teachers receive their training. As people learn about the monasteries, mainly through direct contact with the skilled teachers who come from them, they can begin to recognize these institutions as the ˇ§fertile ground for growing masters of the inner artsˇ¨ and, thereupon, take an interest in their continued existence. Also, to the extent that the digital resources are seen as useful or valuable in the modern academies of the industrialized nations, it is feasible to charge subscription fees along with access and copy fees. These fees can then be used to help support the digital preservation work in particular as well as the overall programs of the classical Tibetan institutions that perpetuate the living cultural tradition.
[1] See, for instance, Christian Wittern, ˇ§Buddhist Studies in the Digital Age,ˇ¨ Chung-Hwa Buddhist Journal, No. 13.2 (May 2000), pp. 461-501.
[2] This story was related by Sermey Khensur Lobsang Tharchin (1921ˇV ), currently resident teacher at Rashi Gempil Ling Kalmyk-Mongolian Buddhist Temple in Howell, New Jersey, USA.
[3] We can easily imagine that there were critics 2,000 years ago who viewed written scriptures as poor substitutes for the knowledge that is held in a personˇ¦s heart and mindˇXthat is, the teachings that one has committed to memory and can thereby contemplate and apply throughout the day.
[4] The technique of printing from carved woodblocks dates back to at least the 8th century. The earliest known dated blockprint, although well advanced in technique, is a scroll of the Diamond-Cutter Sutra (Vajracchedika) in Chinese translation from 868 c.e. As this particular work richly demonstrates (see http://portico.bl.uk/diglib/treasures/ diamond-sutra.html), printing was used not only for text (words) but also for ornate drawings, prayers and mantras (such as one finds on prayer flags or placed within prayer wheels or stupas) and so forth.
[5] For a survey of Tibetan Buddhist digital resources available from ACIP and other sources, see Robert R. Chilton, ˇ§Impact of Digital Technology on Tibetan Buddhist Scholarship and Instruction,ˇ¨ presented at the International Conference on Buddhism and the 21st Century Digital Information Society, celebrating publication of the Koryo Tripitaka CD-ROM by the Research Institute of Tripitaka Koreana (Dongguk University, Seoul, December 7ˇV9, 2000).
[6] We estimate that there are approximately 1.4 million titles in the two collections housed in Ulan Bator, although a good number of these titles are, no doubt, duplicates.
[7] During a recent visit to one such data entry center, a Tibetan resettlement official remarked that the employment offered at the input center ˇ§helps to keep families togetherˇ¨ˇXan important outcome for people living in the settlements.
[8] It might be argued that the traumatic events of the 20th century have created a situation for traditional Tibetan Buddhist societies wherein digital technologies will play a more important role than would be the case had the Buddhist teaching lineages and traditions been left undisturbed.
[9] In a speech in Geneva in October 1999, UN Secretary-General Kofi Annan warned of the dangers of the global digital divide: ˇ§People lack many things: jobs, shelter, food, health care and drinkable water. Today, being cut off from basic telecommunications services is a hardship almost as acute as these other deprivations, and may indeed reduce the chances of finding remedies to them.ˇ¨ (quoted in a BBC News special report, October 14, 1999). One response to this concern can be seen in the creation of the Development Gateway Foundation, which was founded in 2000 as a public-private partnership aimed at bridging the digital divide and using information and communication technologies for poverty reduction and sustainable development. The DGF seeks annual contributions from governments, NGOs, private industry and other donors of US$30ˇV50 million to support its programs.
[10] Due to the absence, in the past, of a universal encoding standard for Tibetan, display of data in Tibetan script has necessitated using one or another of the various proprietary Tibetan fonts. Since each of these fonts has its own way of encoding the Tibetan characters, a user must have access to the particular Tibetan font(s) used in a document in order to view it in Tibetan script.
[11] The Unicode Consortium is a non-profit group established in 1991 and composed of academics and experts from the major computer hardware and software manufacturers. Tibetan script first appeared in Unicode version 2.0 (1996) and was expanded in version 3.0 (2000). The Unicode Standard has been ratified by the International Standards Organization as ISO-10646-1: ˇ§Information technology -- Universal Multiple-Octet Coded Character Set (UCS) -- Part 1: Architecture and Basic Multilingual Planeˇ¨. Efforts are now underway to support Unicode-based Tibetan within a variety of computer operating systems and applications.
[12] Unicode is now recognized by the information technology industry as the standard character encoding scheme for multilingual computing. For example, Microsoftˇ¦s Windows NT, 2000 and XP operating systems use Unicode (UCS-2) internally for all text handling. Full Unicode support for many Asian languages is available in the Asian versions of the Microsoft Office suite of applications.
[13] Until recently, a high-end multimedia workstation capable of digitizing and editing audio-video materials might cost anywhere between US$8,000 and US$30,000.
[14] Whereas a CD can store up to 650 or 700 megabytes of data, the next generation of compact optical disc, called DVD or Digital Versatile Disc, can store 4.7 gigabytes of data on a single-sided, single-layered disc.
[15] Although the term ˇ§bandwidthˇ¨ has certain technical meanings for engineers, from the end-user perspective greater bandwidth simply means that more data can be transmitted in the same amount of time, i.e., a higher data rate or ˇ§speedˇ¨.
[16] While ISDN provides a transmission rate of 128kbps, typical DSL and cable services currently provide ˇ§downstreamˇ¨ transmission rates of between 200kbps and 3000kbps, with rates for satellite services somewhere in the middle of that range. Because of the high data transmission rates they provide, a single broadband connection can be shared among multiple workstations, thus distributing the cost of the connection service between multiple users.
[17] As will be described below, whereas standardized support already exists for the creation and display of images, motion video and such, standardized support is still in the process of being developed for the creation and display of Tibetan-script materials.
[18] JPEG stands for the Joint Photographic Experts Group, a group of digital image experts chosen by national standards bodies and major vendors. The official title of the group is ISO/IEC JTC1 SC29 Working Group 1. The commonly used JFIF (JPEG File Interchange Format) incorporates the best known standard (ISO-10918-1) produced by this group.
[19] Officially known as ISO/IEC JTC1 SC29 Working Group 11.
[20] MPEG-2 (ISO-13818) has been adopted as the de facto standard used by major television and cable companies for relaying digitized moving images. A more highly compressed and lower quality format, based on the standard known as MPEG-1 or simply MPEG (ISO-11172), is used for transmission of moving images over the Internet and on Video CDs (which have lower capacity than DVDs).
[21] MP3 format is defined under ISO-11172: ˇ§Information technology -- coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbits/s -- Part 3: Audioˇ¨.
[22] Unfortunately, even on this point there is no general consensus since there are at least three or four different roman-letter transliteration systems (plus minor variants) in widespread use. The ACIP transliteration system has attained a semi-de facto status as a standard format for interchange of full-text Tibetan documents due to the amount of data distributed in that format and the consequent support provided for ACIP-format files by the various proprietary Tibetan software programs.
[23] A brief overview of some of the most commonly used software programs for computing in Tibetan can be found in Chilton, ˇ§Impact of Digital Technology on Tibetan Buddhist Scholarship and Instruction,ˇ¨ cited above.
[24] For example, porting from Windows to Macintosh or from OS/2 to Unix.
[25] A notable exception is seen in The Document Processor (TDP) written by Leigh Brasington and available for free download from http://www.geocities.com/potalasw/. TDP has the ability to read and integrate Tibetan documents created in a number of the most common formats and can also export Tibetan documents in various formats including Unicode.
[26] ˇ§Digital Tibet: Tibetan Studies and the Internetˇ¨, hosted by the University of Virginia, a leading research institution in both the field of electronic text and the field of Tibetan Studies, held on 5ˇV7 April 2001.
[27] More precisely, the format adopted is UTF-8, a ˇ§transformation formatˇ¨ of Unicode into single byte (8-bit) characters that is highly portable between various computer environments including those that do not support multi-byte characters.
[28] One of the major objectives of the University of Virginiaˇ¦s ˇ§Digital Tibetˇ¨ conference was to establish a forum through which common standards for cataloguing Tibetan materials could be developed.
[29] Dzongkha can be viewed as a superset of classical Tibetan since Dzongkha incorporates the entire classical Tibetan language within its grammar and lexicon.
[30] However, it should be noted that in order for the formatting to persist between different users and applications, the formatting markup employed should adhere to one of the widely recognized standards such as HTML, XML, or RTF (Rich Text Format). The main criterion here is that the formatting tags should be composed of plain text characters rather than the binary-level markers used in proprietary formats such as those native to commercial word processing applications.
[31] The four consortium members will continue their established cultural conservancy programs while working to ensure easy integration of all materials produced. The four founding consortium members and their areas of responsibility:
1. The Asian Classics Input Project (USA) ˇV locating, cataloguing and inputting of surviving textual materials.
2. The Institute of Tibetan Classics (Canada) ˇV editing and translation into English of key classical texts.
3. The Orient Foundation (UK) ˇV audio-video recording of oral explanations and artistic traditions; cataloguing of recorded Tibetan cultural resources worldwide.
4. The Meridian Trust (UK) ˇV recording, cataloguing, and digitization of Tibetan cultural traditions on film and videotape.
Consortium partners and affiliates will also undertake a program of text acquisition and digital scanning of rare and endangered texts. A technical development team composed of key consortium staff will direct the onsite technical implementation and training component and also the development of an integrated online digital resource repository (www.TibetanKnowledge.org).
[32] In areas where Internet access is limited, the digital resources can also be made available for local use via periodic distributions of the data on high capacity storage media such as CD, DVD, or digital tape.
[33] As mentioned previously, consortium members are taking a leadership role in the creation of standardized support for Tibetan within multilingual computing environments.
[34] And it might even be argued that saving Tibetan Buddhist culture is not in itself the only goal; that the greater goal extends to the ˇ§advancement of real peopleˇXto learn, and preserve, and to pass on to others the great ideas that will help and uplift humankind; and even, if we are to believe the authors of these very books, to actually reach a place which is beyond all suffering and the state of death itselfˇ¨ (ACIP Release IV: A Thousand Books of Wisdom, User Manual, 1998, page 195).