The structure of the DCS

The data made available by the DCS are stored in a relational MySQL database. This database is a heavily simplified version of the SanskritTagger database (details).

The DCS database consists of three main modules:

  1. The dictionary stores lexical units, their meanings and verbal forms.
  2. The text collection assembles lemmatized Sanskrit texts.
  3. The dictionary and the text collection are connected by a list of occurrence pointers, which is the most important difference to other digital resources of Sanskrit. For each separable string contained in a text, this list records the lexical units that constitute the string. Therefore, the DCS makes it possible to search for lemmata instead of inflected forms or parts of them. This linguistic disambiguation is an obvious advantage in philological research.

The structure of the DCS database can be sketched as follows: