Terminology
infrastructures in support of the terminology market in Europe
Christian Galinski, Infoterm. |
![]() |
|
Abstract: There
seems to exist a vexing relationship between the possibility of using
one's mother tongue and the positive economic development of the respective
language community. People whose mother tongue is not (or not sufficiently)
developed from the point of view of specialized languages or who are denied
thepossibility of using their mother tongue in education and training,
for accessing information or in their daily work situation tend to be
and/or feel disadvantaged. In most cases the limitation of the use of
a language to areas such as culture and folklore results in the - almost
irreversible - loss of its applicability in professional communication.
On the one hand, the 'linguistic map' of Europe reveals the richness of
the European cultural heritage, but it should not, on the other hand,
be a source for complacency, if we consider the potential for conflict.
It definitely needs a framework of measures at European, national and
language community levels in order to prevent the smaller language communities
from dropping into a seriously disadvantaged situation. The planned European Network of Terminology Documentation Centres (TDCnet) will be a cornerstone of the future terminology infrastructures in Europe. It will also support the further development of the terminology market by providing information on existing terminology resources, activities, experiences, services, etc., and on the conditions of their availability. Wherever and whenever specialized information and specialized knowledge are being prepared, represented, processed, transformed and transferred, terminology is accorded a crucial role. There is, therefore, hardly any area in an enterprise or other specialized organization, where terminology is not used.
1 Lack of awareness and fragmented situation On the one hand, terminology is of fundamental importance as it represents specialized knowledge at the level of concepts (which are the basic units of subject-field related thinking/cognition, knowledge and communication) ; this means, that terminological data are the main 'content carriers' to record, order, store, manage, represent, retrieve, disseminate, communicate or transfer specialized information and knowledge. On the other hand, there is comparatively little awareness of this importance even in the quarters of the specialists who are the primary creators and users of the terminology of their respective suject fields. This is partly due to the fact that terminologies are created as a rule by domain experts of various levels in a multitude of suject fields in an 'evolutionary' rather than coordinated way. This results in a highly fragmented and sectorized situation with respect to most terminological activities and applications. The emergence of a terminology market for terminological products and services will certainly improve this situation, but it also needs terminology infrastructures to support the creation and distribution, re-use and use of terminologies - especially in multilingual and multicultural Europe. 1.1 Economic and social impact of mother-tongue use There seems to exist a vexing relationship between the possibility of using one's mother tongue and the well-being of the respective language community. People whose mother tongue is not (or not sufficiently) developed from the point of view of specialized language or who are denied the possibility of using their mother tongue in education and training, for accessing information or on their daily work place tend to be and/or feel disadvantaged. Smaller language communities in particular, including linguistic minorities of all sorts, have to balance many linguistic disadvantages by making more efforts than the surrounding larger language communities in order to prevent marginalization with respect to scientific-technical and economic-industrial development - which may ultimately lead to socio-economic decline. In most cases the limitation of the use of a language to areas such as culture and folklore results in the - almost irreversible - loss of its applicability in professional communication. This calls for a distinct conciousness of the need for terminology planning in many/all language communities, as well as concrete legal and administrative action to support it. 1.2 Situation of specialized languages in Europe In today's European Union (EU), there are only a few language communities whose mother tongue is openly discriminated against or even suppressed, but there are many whose language situation can be described as disadvantaged for various reasons. An unorthodox analysis of the 'linguistic map' of Europe shows that from the point of view of language variety, Europe must be considered rather 'poor' compared to other continents. Nevertheless the language distribution is far from being simple and problem-free. According to recent figures, about 55-60 languages are used as a mother tongue by language communities of more than 50,000 speakers (this includes non-European languages used by foreign workers or refugees, but does not take into account the Caucasus region, which is a linguistic cosmos of its own). If only languages with more than 500,000 mother-tongue speakers are considered, the figure drops to about 45 languages. Some of these language communities do not much care about LSPs (languages for special purposes), nor do a few of the language communities of having less than 500,000 speakers who are undertaking serious efforts to develop their language as a tool of modern communication. In the EU more than 260m citizens use one of the four major languages which each have more than 50m mother-tongue speakers: German [90m], English [61m], French [58m], Italian [55m]. Some 80m people use as a mother tongue one of the other 7 official working languages of the EU institutions: Spanish [25m - discounting Basque, Catalan and Galician speakers], Dutch [21m], Portuguese [10m], Greek [10m], Swedish [9m], Danish [4m], Finnish [4m]. An additional 20m EU citizens use one of more than 10 other languages which have more than 50,000 speakers. Together there are more than 30 officially recognized language communities in the EU, a further 10 are seeking official recognition, not to mention all sorts of 'minority languages'. The ratio of small to large language communities, therefore, is about 35% (this is if Spanish and Portuguese are counted among the small language communities - which can of course be argued). In this context, it is also difficult to evaluate the situation of people who speaking a language which is considered to be a minority language in their immediate area, but which is a majority language elsewhere in Europe or in the world. Each further extension of the EU will push the above-mentioned ratio towards 50% and eventually higher, which would mean that sometime in the future, more than half of the EU population might belong to a potentially disadvantaged language community. Various references consulted with regard to language statistics are by no means consistent. Therefore, individual figures may be questioned, which, however, has no substantial effect on the overall picture presented here. The figures, on the one hand, represent the richness of the European cultural heritage, but should not, on the other hand, be a source of complacency, if we consider the potential for conflict. It definitely needs a framework of measures at European, national and language community levels in order to prevent the smaller language communities from dropping into a seriously disadvantaged situation.
2 The ‘Terminology Market’ Not in the least due to the European Commission's emphasis on multilingual aspects in all Community R&D Programmes, a terminology market - as it rightly deserves to be called - is gradually emerging. 2.1 Terminology products and services for whom? Terminologies emerge in the following, among others:
They are created primarily by domain experts of various levels in a multitude of suject fields in an 'evolutionary' rather than a coordinated way. The expert communities, who comprise the primary creators and users of their domain specific terminologies, thus also cause well-known communication problems, such as homonymy and synonymy, which some of them try to resolve by means of descriptive or prescriptive terminology work. Terminology work, therefore, is carried out in a large number of subject fields, usually by groups of experts. In addition, it should be remembered that it is a time-honoured scientific tradition to define what one is talking about in scientific and technical literature (a general rule for instance in standardization) - a good tradition often neglected today in scientific discourse. Since science and technology increasingly influence all walks of life and society, deficient terminologies are not only causing communication difficulties among the respective peer groups, but also have negative repercussions on many people who have to use specialized terminology in the work place, as consumers, as citizens, and more and more, even in intra-family communication. Potentially and increasingly everybody is or could become a more or less frequent user of some specialized terminology in his/her life. The gradually emerging 'terminology market' will offer terminological products and services - which are in fact a particular family of information products and services - to a variety of consumers and clients, such as terminology creators (e.g. researchers, technicians, administrators, etc.), terminology data producers (e.g. terminology database creators, specialized lexicographers, etc.), terminology data distributors (e.g. dictionary publishers, online information services, etc.), and terminology users in general. Terminology creators, data producers and data distributors in most or many cases are also or can become re-users of existing terminological data. 2.2 Terminology products Terminology products mainly comprise:
Terminological information (if terminology documentation is included) consists of three distinct fundamental types of data:
Each of them requires a different type of database system (comprising a set of distinct databases or one integrated database system for different data models). A comprehensive terminology information and documentation centre like Infoterm has to deal with all three types of database systems modelled on the basis of well-defined data categories (according to the 'objects') for different purposes. The data as well as the respective software can be used as 'products' and as a basis for a variety of 'services'. The total volume of the above-mentioned types and categories of data may be estimated as follows:
Terminological data proper Terminological data from the formal point of view represent specialized knowledge at the level of concepts. They do not comprise only terms and definitions and similar types of linguistic symbols, since subject-field related concepts can be represented by any kind of symbol. Terminological data can be offered
In palm-top computers or in even smaller pocket-sized electronic dictionaries, the terminological data may be integrated with the respective software or even hardware. Terminological data can be acquired by customers on the terminology market for internal use only or for re-use, in the course of terminology data interchange, etc., on a variety of different data carriers (diskette, CD-ROM, etc.). However, different user groups need terminological data of different degrees of complexity and granularity for different purposes. It is therefore highly economical to prepare multi-purpose terminological data for different purposes and users, whose needs are taken care of by appropriately tailored customer-specific user-interfaces. Terminological data can also be used very efficiently as the intellectual 'skeleton' (or infrastructure) around which the contents of domain-specific encyclopedias can be organized. Terminological tools Terminology application software provides the most common tools for the handling of terminological data in a variety of ways. Terminology management systems (TMSs) are designed as dedicated tools to record, store, process and output terminological data in a professional manner. There are different kinds of TMSs for different purposes. Terminology databases consist of terminological data and a TMS to handle these data. Terminology data banks (TDBs) are more or less sophisticated organizational/institutional structures established for the handling and maintenance of large amounts of terminological data for a multitude of users with the help of a TMS. TDBs can consist of several or many terminology databases. TDBs are supported by a TMS often running on a mainframe, mini-computer or work-station, whereas most of the PC-based TMSs today are employed by individual users, small cooperatives (sometimes integrated by a LAN), or larger departments (where the individual workstations are usually linked by a more or less sophisticated LAN). On the one hand TMSs are increasingly evolving into tools for various applications, such as:
On the other hand TMS modules of varying degree of sophistication are implemented into all kinds of application software. They are thus increasingly applied in a variety of information and communication workflows. In the future, appropriately designed TMSs or TMS modules will find new markets particularly in applications, such as:
If appropriate tools were available for computer-assisted cooperative (and network-based distributed) terminology work, the preparation, processing and maintenance of terminological data could be carried out faster, more efficiently, and in line with modern quality management approaches. Needless to say, this would help the terminology market to develop considerably. There is a tendency in software development to try to integrate TMSs into all kind of information and work flows thus turning them into management tools. 2.3 Terminology services At present the following terminology services already exist or are foreseeable in the near future:
Consultancy services and training Consultancy services and training are most often needed in conjunction with application aspects, such as:
As a rule today's experts have not studied the basic theory of logic and epistemology underlying the science of sciences (or science theory - which also comprises the basic theory of terminology). They therefore often need training in the theoretical and methodological basics of terminology science and terminography. Large organizations/institutions often need to integrate terminological methods and tools into their information management or quality management schemes. Government agencies and other public authorities in many countries want to implement knowledge transfer policies, which would benefit greatly from the appropriate terminology planning methods. Institutions and organizations also frequently need advice with respect to legal problems (especially related to intellectual property rights) concerning the application of terminologcal data and tools. Terminology consultancy, therefore, should become an integral part of many types of consultancy services, especially business consulting, communication consulting, etc. It will become indispensable in future e-commerce, e.g. for the preparation of online product catalogues, interchange of data on technical devices, etc. As knowledge management cannot do without terminology, information and knowledge management also need a strong terminology component. It has
to be mentioned, however, that with a few exceptions (e.g. China, Greece,
etc.) these needs have not been recognized because decision makers are
not aware of the usefulness and effectiveness of such services. Therefore,
it is still a dormant market owing to lack of interest and investment. Outsourcing Increasingly, various types of institutions and organisations are considering outsourcing as a suitable method for coping with identified limited terminological needs. Outsourcing may refer, for instance, to:
Information services Increasingly, terminological products and services will be available as one or attached to one of many kinds of information services available on the market. This is similar to the general situation in the field of information and communication technology (ICT). They will also be increasingly integrated into other ICT applications. For the distribution of terminological data to different user groups with various user needs, efforts should be made to establish market-oriented and fee-based information networks for providing terminological data proper, as well as value-added terminological products and services, on a commercial basis. The clients will thus have to pay for terminological products and services. The greater the possibilities for allowing clients to chose from an ever increasing variety of terminological products and services, the more affordable they will become. High-end and low-end services in the field of terminology As a rule, terminologists working on the preparation or revision of terminological data (especially those used to support linguistic applications, such as translations, localization, technical writing, etc.) are situated at the lower end of the job ladder. Those who can use sophisticated tools and have access to a variety of information sources can 'sell' their services at a much higher rate. If they know a variety of tools and applications well, their services may be highly valuable for companies and institutions that want to improve their efficiency. This translates into high service rates similar to those providing training in the use of methods and tools. Terminology consultants who can advise industry and public institutions on the most efficiant application of terminology and language engineering tools will certainly find themselves at the high end of the business ladder. The future belongs to those who can demonstrate the efficiency increase by integrating terminological methods and tools into any specialized information-, communication- and knowledge-related application, particularly if it is multilingual.
3 The terminology infrastructures in Europe Given the number of terminological entries that need to be prepared in a multitude of languages in science and technology and other suject fields, this monumental task cannot be performed without the help of millions of experts who need to do this regardless, if they want to work and communicate efficiently and effectively. In most cases today such terminology work is carried out in the form of thousands of small cooperative efforts scattered all accross the globe and in many suject fields with little inter-connection. It is performed as a rule in a non-commercial (let alone non-profit) framework. In some cases terminological activities are carried out 'horizontally', i.e. at the language level across many or all suject fields. In many or most cases, however, they are carried out 'vertically', i.e. within a given subject field or subfield. In smaller language communities (or even larger language communities that feel 'threatened' for some reason or other) the proportion of horizontal terminological activities/efforts will probably be bigger than in larger language communities with many developed specialized languages. Since the beginning of the 1980s, the emergence of new terminological infrastructures has been speeding up. At the same time, the networking at transnational, regional and international level has also been improving. Horizontal terminology infrastructure(s) In every language community, it requires a public or semi-public infrastructure to:
The future horizontal terminology infrastructure is composed of five main structural elements or aspects:
Often two or more of these elements/aspects can or will be combined, and in many cases they are or should be institutionalized in order to be effective. Vertical terminology infrastructure(s) In contrast to the impression one may get when visiting bookstores or libraries, most terminological data are not prepared by terminologists or lexicographers, but by subject specialists, normally in teams, with the primary aim of facilitating communication within the respective subject field. In every larger language community, several hundred or even several thousand scientific/technical institutions, learned societies or professional associations organize the preparation of terminology. In the field of standardization alone there often more than 100 working groups of technical committees standardizing or harmonizing the terminology necessary for enhancing the coherence and consistency of the standards' texts and for facilitating understanding. Most of these efforts are carried out without knowing about similar projects, lacking basic knowledge of terminological principles and methods, and without using appropriate not to mention state-of-the-art tools. The degree of (formal or informal) authoritativeness may vary significantly - sometimes even within an organization. World-wide there are probably more than 50,000 terminology working groups or committees actively preparing terminologies at any given time. By far most of terminology work is carried out as a collective work by subject-field specialists under the umbrella of a more or less 'authoritative' organization or institution. Sometimes the terminology contained in technical rules/regulations at national level is also considered as quasi-standardized terminology. Harmonized/standardized terminologies are issued by an official public or officially authorized harmonization/standardization body. Often the documents containing such terminologies are referred to in laws, so that the terminology becomes 'legalized'. Quasi-standardized terminologies are prepareds by subject-field authorities recognized in the respective field (e.g. IUPAC) or by an institution/organization authorized for this purpose, but not belonging to the official standardization framework. Other kinds of 'authoritative' terminologies are at least issued by or published under the patronage of a (formally or informally recognized) subject-field authority. The authoritative nature of data (viz. the degree of authoritativeness) depends on the status of the data originator as one of:
and on whether the data is:
as well as on whether it:
As a rule there is no absolute 'authority' covering all applications, the authority in most cases is restricted to an implicitly or explicitly defined scope, but can often be extended towards similar/neighbouring applications. Sometimes non-authoritative terminology being prepared by one or more individual experts on behalf of an issuing institution/organisation (e.g. publisher) may also acquire the reputation of being 'authoritative'. Terminology standardization Terminology standardization covers two distinct aspects, which belong to two different infrastructures. The standardization of terminological principles and methods (led by the Technical Committee ISO/TC 37 "Terminology (principles and co-ordination)" of the International Organization for Standardization - ISO) certainly belongs to the horizontal infrastructure, whereas the standardization of terminologies in the various technical committees definitely is an element of the vertical infrastructure. World-wide there are probably more than 5,000 terminology working groups in standardization active at any given time. Special/diagonal terminology infrastructure(s) Special terminological efforts are undertaken in the fields of intellectual property rights, customs, air traffic control, etc. They relate to a multitude of subject fields. Some of these are undertaken with a high degree of authoritativeness. Legal (or quasi-legal) terminologies are, for instance, determined by legislation or jurisdiction at international, European or national levels.
4 TDCnet - "European Network of Terminology Information and Documentation Centres" Given the fact that linguistic diversity is an essential component of the national and regional diversity of the cultures of the member states, the European Commission conceived the EU Programme for a "Multilingual Information Society" (MLIS). Businesses and citizens, whatever their language, should enjoy equal opportunities for participation in the new information age. They therefore need multilingual facilities for creating, exchanging and accessing information. The MLIS Programme represents a concentrated effort to speed up the process of getting new language processing technology onto the market. One of the three MLIS action lines is concerned with the construction of infrastructures for European language resources: such as dictionaries, terminology databases, grammars, speech data. The European Project "European Network of Terminology Information and Documentation Centres" (TDCnet) is meant to be a vehicle for establishing the terminology infrastructure in support of the multilingual information society in Europe. Ten partners of the TDCnet Consortium have taken the initiative and established the network, which is open for other TDCs from the European Economic Area to join. 4.1 Domain covered by the TDCnet Project This project aims at creating a 'virtual terminology directory' in the form of a logical and physical network of the official or officially recognized terminology information and docu-mentation centres (TDCs) in Europe. The active networking will comprise:
Thus duplication of efforts will be reduced on the one hand, while on the other hand, a higher degree of overall completeness of information by means of the TDCnet will be achieved. The implementation of the network will be accompanied by the development of new products (such as CD-ROMs etc.) and services (e.g. document delivery services etc.). A promotion strategy will be implemented in order to make new potential user groups aware of the information available. 4.2 The market for the TDCnet In the past the academics used to searching for information via secondary and factual reference data were the main users of existing reference data in the field of terminology. Users (such as translators) and re-users (such as terminology software developers also offering terminology data) still proceed like 'hunters and gatherers' to whom systematic information - not to speak of complete information - is not easily accessible. In industry, where terminology work often has to be performed under great pressure in terms of time and capacities, the present situation leads to the duplication of efforts due to a relative lack of reference information easily accessible online or in the form of bibliographies and directories and to high expenses due to the lack of information on methodology, existing training opportunities, existing terminology services, etc. In this regard, some language communities are highly disadvantaged/under-privileged (especially those that have to undertake special efforts to create 'terminologies' in their own language in order to develop the language into a modern tool of communication). Others enjoy better access to existing information sources due to the existence of well staffed and equipped TDCs and to the activities of pertinent terminology organisations. The systematic approach to providing reference data to users largely on the basis of existing databases (which in most TDCs at present are primarily used as in-house information tools) and to promoting this effort widely necessitates the development of new information products and services in order to cope with foreseeable information bottlenecks. TDCnet will provide everyone in all language communities with an equal opportunity to access existing reference information wherever in Europe it may come from. 4.3 Technology The TDCnet's extranet on the Internet serves as the technical basis for linking existing distributed heterogeneous databases and for organizing the data flow and communication between the collaborating TDCs. The procedures for linking existing databases (via interfaces etc.) and for the interactive co-operation of TDCnet partners (by means of dynamic worksheets, etc.) have to be developed. In the course of this development, some programming efforts are necessary to solve character set problems, data security questions, etc. Some of the data from the virtual reference directory can and should be made available on CD-ROMs or other appropriate information carriers for use at the workplace of individual users or in LANs. A European Terminology Information Server (ETIS) representing among other things a first level access to secondary (viz. bibliographic) and factual (i.e. on activities, institutions, etc.) information in the field of terminology will provide:
Some of these are recorded and offered in as full a form as possible, while others are a representative selection from large distributed data collections. From ETIS the user can be switched through to the distributed databases of the project partners for either more detailed information or for additional information depending on the type of information required. Further TDCs and other collaboration partners beyond the scope of the TDCnet will have the opportunity to provide pertinent data via ETIS in the future. 4.4 Organisation of the Project In the first phase the TDCnet Project Partners have:
In the second phase (equivalent to the second project year) this prototype will be:
The TDCnet will, during the second phase, be extended to all existing official TDCs in Europe. Later - after completion of the project - other documentation centres and further collaboration partners can join or be linked to the TDCnet. There are plans for establishing user clubs (e.g. with European Union institutions and organizations, international organizations of the United Nations system (Joint Inter-Agency Meeting of Computer-Assisted Terminology and Translation), and other pertinent organizations (such as the European Association for Terminology - EAFT), and possibly a selected number of organizations in Central and Eastern European countries) in order to ensure user feedback as early as possible. 4.5 Strategic impact The TDCnet project represents an indispensable building block of the future terminology infrastructure in Europe, because it establishes a long-term framework for systematic co-operation, in the first instance between the existing official TDCs and their wealth of holdings, data collections and experience accumulated over the years, thus creating synergies at every individual site. The physical and logical network enables the development of new (traditional and value-added) information products and services. The TDCnet will be instrumental in helping to establish TDCs where they are needed, but do not yet exist. The TDCnet will provide the users (traditional users as well as new user groups and re-users) with all existing secondary and factual data in the field of terminology and the reference information necessary in order to find out:
4.6 Dissemination The activities within the framework of the project will lead to a higher degree of completeness (of individual records as well as of entire databases) and availability at every site and overall in the network. Dissemination to potential users will be facilitated, by providing efficient access to data. First level information is provided via ETIS (European Terminology Information Server). Some of the reference information will be made available on a commercial (or at least semi-commercial) basis both online and on data carriers (e.g. CD-ROMs) after the project has established the TDCnet on a permanent basis.
5 Outlook The terminology infrastructures as well as the terminology market(s) are still characterized by the co-existence of many loosely interconnected elements. But gradually the mosaic of these elements is becoming more complete, while at the same time turning into a dense network of interactive structures and activities. Cooperation in terminology, which started at an international level, has now become organized at a transnational level within the framework of some language families in Europe. Within the various language communities, however, there are very different language situations with respect to the evolution of terminology infrastructures and the terminology market. Nevertheless a certain pattern seems to be evolving - as was described above. The development of the terminology market and the development of an infrastructure mutually support each other. Some tasks/activities, such as the collection and 'housekeeping' of information, which are in the public interest, must continue to be funded by the community, whereas others increasingly are (and should be) financed by the users, especially those from the private sector. Obviously development has been speeding up recently, but there is still a long way to go. Access to information in the field of terminology is still not as easy for the user as would be desirable. Cooperation among the 'players' in the field still needs promotion and support. Quality of information and services has to be enhanced with a view to user needs - which also requires a higher concern for multifunctional data. The teaching and training situation is still characterized by many 'missing links'. The "European Network of Terminology Information and Documentation Centres" (TDCnet) will be a cornerstone of the future terminology infrastructure in Europe. It will also support further development of the terminology market by providing information on existing terminology resources, activities, experiences, services etc., and on the conditions of their availability.
References: BUDIN, G. Language planning and terminology planning. Theories and practical strategies. Wien: Infoterm, 1993 (Infoterm Document 06-93en) GALINSKI, C. The terminology market. Wien: Infoterm, 1995 (Infoterm Document 23-95en) GALINSKI, C. Terminologie und eine gemeinsame Kultur in Europa. Wien: Infoterm, 1994 (Infoterm Document 21-94de) GALINSKI, C. Terminologisches Informationsmanagement in Harmonisierungs-projekten der EU. Wien: Infoterm, 1994 (Infoterm Document 20-94de) GALINSKI, C. Fachsprachen- und Terminologiepolitik in Europa. In: GRINSTED, A.; MADSEN, B.N. [eds.]. Festskrift til Gert Engel i anledning af hans 70 års fødselsdag. Frederiksberg: Samfundslitteratur, 1994, p.248-265 (Infoterm Document 14-94de) GALINSKI, C. A systematic approach to knowledge transfer. Austrian contribution. Trade and Development Board Ad Hoc Working Group on Interrelationship between Investment and Technology Transfer. Third Session, Geneva, 21 March 1994. Wien: Infoterm, 1994 (Infoterm Document 12-94en) GALINSKI, C. Economic and social impact of mother tongue use in specialized communication in Europe. Wien: Infoterm, 1994 (Infoterm Document 05-94en) GALINSKI, C.; LERAT, P. French language planning in the French speaking world. TermNet News 14 (1993), no. 42/43, p.11-18 GALINSKI, C. e.a. International cooperation in terminology today and tomorrow. Wien: Infoterm, 1993 (Infoterm Document 12-93en) INFOTERM [ed.]. Guidelines for a national terminology planning policy. Wien: INFOTERM, 1989 (Infoterm Document 12-89en)
|
||