+1 917 8105386 [email protected]

A Formal Description of Arabic Syntax in Definite Clause Grammar

1. formulate a problem 2. provide some examples 3. number these examples and when referring to them 4. draw syntactic trees of these examples 5. explain the process and how it might be complicate (refer to Chomsky or other sources) 6. it doesn't have to provide a solution 7. don't make it as a book report. A Formal Description of Arabic Syntax in Definite Clause Grammar Hisham E1-SllisMny IBM Scientific Center, 56, Gameaat El-Doual EI-Arabeya St. Mohandesseen, Cairo, Egypt Abstract Arabic has some special syntax features which lead to complex syntax structures. We have developed a formal description of Arabic syntax in Definite Clause Grammar. This grarnrnar is characterized by its high descriptive power due to its dual formulation in terms of functions and in terms of grammatical categories. The developed grammar has a high coverage of Arabic language and has context sensitive capabilities. It is suitable for the advanced applications of natural language processing. 1o Introduction Arabic has some features which lead to comlpex syntax different from those of the European languages. Moreover, Arabic grammar exists cmly in a descriptive form and there is no comprehensive formal rep~ resentation for it so far. The few attempts to give a formal mode~ for Arabic sentences were based on transformational generative grammar - [111, [2], [3] and others-but some linguists adopted more recent linguistic models such as Lexical Functional Grammar [4], dependency grammar [5] and functional grammar 16]. Selecting a suitable grammar formalism for Arabic and the formulation of the grammar itself is the subject of a big debate in the Arab countries nowadays. Our descriptio:n of Arabic syntax in Definite Clause Grammar is different from the attempts undertaken so far. 2. Deft,ire clat,se gram,,ar for Arabic syntax In this grammar, the arguments of non terminals are used to hold special features of Arabic words such as the definitely or indefinitely determined feature of norms which is essential to recognize many structures. Also, the ability of Definite Clause Grammar to provide a general treatment of context sensitivity through the proper use of arguments of non terminals makes it easy to account for agreemeats such as number, gender and person agreements. Another important characteristic of l)efinite Clause Grammar, which suits Arabic, is the possibility of imposing extra conditions on the constituents of a phrase which must be satisfied for a rule to be valid [7]. 3. The develol,ed /ormal g,'ammar Due to the fact that there is IIO single basic word order for Arabic sentences, three basic sentence types were defined: a. Nominal sentence: a sentence that does not contain a verb or contains a verb which follows the subject. b. Verbal sentence: a sentence that contains a verb which precedes the subject. c. Sentences with special structures such as vocative sentences. Sentences are further classified functionally and according to their modalities. Larger sentences are also described in this grammar by conjoining sentences or embedding simpler sentences. We have used the grammatical categories together with the functional roles to define the syntactic structures. Grammatical categories alone were considered not sufficient to describe all the structures 345 since word categories in Arabic are globally classified into verbs, nominals and particles only. Nouns, adjectives and adverbs for example are differentiated in tt'te sentence according to their functions. This dual formulation increases the descriptive power of the grammar. In order to use this dual formulation to define the syntactic structures, some non standard definitions of grammatical categories were used. A nominal sentence (ns) is defined formally as composed from a 'mobtadaa' phrase, mbp, and a predicate phrase, predp: ns(ns(MBP, PREDP)) -~ mbp (N,G,m, -., MBP), predp (N,G,n,PREDP). A mbp is defined as a noun phrase (np) that can have 'motaalkat' which are either a prepositional phrase (PP) or adverbial. A predp can be one of the following: • a noun phrase that can have 'motaalkat' • a prepositional phrase or adverbial - a sentence which can be : - a verbal sentence - a nominal sentence (under certain conditions). Verbal sentences (vs) can be followed by either a subject phrase, sp, or a subject phrase and a complement phrase, cop: vs(vs(VP))-~ vp(T,V,S,I',VP). vs(vs(VP,SP,COP)) + vp(T,a,S,P,VP), sp(-,-,S,-,P,SP),cop(M,COP),{M=<T} cop(o,cop([ 1)) -~ I 1. For verbs in the passive voice the sp is replaced by 'naabfael' phrase (nsp), which can be either a np or pp. For non-terminal argument definitions the reader is referred to sections 3.1.1., 3.1.2 and 3.1.3. Tlle values of the arguments indicated above are: a for active voice, n for indefinite determination, m for definite determination and - for acceptance of any value. Verb phrases (vp) are defined as follows: vp --~ (cop), (particle), verb, (cop). whereas noun phrases (rip) are defined as follows: np -~ (particle), noutL (postmoditiers). Subject phrases are noun phrases whereas complement phrases modify the verbs and are one or more noun phrases (such as tile direct object) and/or prepositional phrases. The cop category is used to cater for the cases where the sp separates the verb from some of its modifiers, which is a characteristic of Arabic syntax. Sentences with special structures haw,, different structures which vary from one case to anotller. Sp and cop can be relative pllrases and in some cases the predp can precede the mbp. Categories with no equivalent in English are indicated between single quotes and categories inside the brackets are optional. 3.1. Non Terminal mgu,tents Non-terminal arguments are used to hold features of Arabic words necessary for recognizing some structures and for allowing agreements between the different costituents of the sentence. Arguments are introduced in the lexical entries of words and are inherited by the phrase in which the words are constituents, h't this grammar, arguments are defined as fol- ]OWS: 3.1.1. Arguments associated With verbs T transitivity, V voice, P person, S semantic feature. 3.1.2. Arguments associated with nouns N number, G gender, D definite or indefinite determination, I:' person, S semantic feature. 3.1.3. Arguments associated with pronouns N number, G gender, P person. M is an argument associated with senre non-terminals to test the possibility of modifying a verb witll objects. 3.2. Use of semantic features to reduce ambiguity in Arabic senstences. Semantic features art, used with verbs 346 and nouns in order to test the subject verb semantic agreement. The use of the described semantic features in the developed grammar helps in the diifferentiation between the subject and the object in Arabic sentences, since it is not possible on purely grammatical basis (except in some exceptional cases) to differentiate between them. 3.3. Examples In this section, examples of the syntactic structures of a nominal sentence and a verbal sentence according to the developed grammar are given. 3.3.1. Example of a nominal sentence: Which means: the big dog is in the garden. In arabic, the definite article is a prefix, there is no copulative verb and direction of writing is from right to left. SIS /\ pF p 7 p pp np ~ p ad} noun I I I I the garden in big the dog 3.3.2. Example of a verbal sentence: Which means: the boy drinks the water in the house v,~ cop sp .- cop I PP /\ np np np p I r I the water the child the house in v L drinks 4. Conclusiml This grammar has been implemented in a syntactic analyzer [8] developed in Prolog on a 1'C/XT-286 and tested on a large number of Arabic sentences. It is hoped that the developed grammar will contribute to the efforts undertaken recently in the Arab countries towards the development of a comprehensive, agreed upon, formal grammar for Arabic syntax. References [1] Bakir M. 'Aspects of clm,se structure in Arabic: a study of word order variation in literacy Arabic', Ph.d., Indiana University, 1980. [2l AI-Khuli M.'A contrastive transformational grammar: Arabic and English', Leiden: Britl, 1979. }3] Ayoub G.'Structure de la phrase verbale en Arabe standard', Ph.D., in: Analysis/Theories, 1981. 14l Fehri F. 'Complementation ct amophore en Arabe moderne: t,ne approche lexlcale fonctionnelle', These de Doctorat d'Etat, Univcrsitc dc Paris 3, 1981. [5] Owens J.'Structure, (:lass and Dependency: Modern linguistic them y and the Arabic grammatical tradition', in: Lingua 64, 198,1. [6) Mout(makil A. 'Pragmatic functi(ms in a {unctional grammar of Arabic', Dordrecht: Forts Publicatiolls, 1989. [71 Pereira F., Warren D 'Definite clause grammar for language analysis- A survey of the formalism and a comparison with transition networks', Artificial Intelligence, Vol. 13, pp. 231 - 278, 1980. [8] Hisham E1-Shishiny. 'A syntactic analyzer for Arabic sentences', IBM-CSC Technical Report # 32, 1989. 347

Ready To Get Started?

GET STARTED TODAY