ANNa

An anlological proportion (a more precise term for "analogy") is a relation between four elements A, B, C, and D, usually of the same nature. It is a statement of the form "A is to B as C is to D", written "A : B :: C : D". Such a statement means that the relation between A and B is similar to the one between C and D.

The problem of analogical proportion can be split in two key problems, namely, analogy resolution and detection. Analogy detection is, as the name indicates, the problem of deciding whether four elements A, B, C, and D are in analogical proportion. Analogy resolution is the problem of finding the missing element of an analogy A:B:C:?.

These two problems provide a logical framework to address learning, transfer, and explainability concerns. Such a framework finds useful applications in artificial intelligence and natural language processing.

Morphological analogies are analogical proportions between words, using morphological relations. For example in English, dead:undead::do:undo is a morphological analogy: undead is dead with the prefix un-, and undo is do with the prefix un-. In other words, dead is to undead as do is to undo.

What are ANNa-MD and ANNa-MR?

ANNa-MD (ANNa for Morphological analogy Detection) and ANNa-MR (ANNa for Morphological analogy Resolution) are deep learning models designed to tackle morphological analogies. They displays competitive performance on analogy detection and resolution over 11 languages.

To transform the words into computer-readable elements we use an embedding model inspired by Kim et al. This embedding model is capable of capturing morphological features of wods and express it as a vector called word embedding.

ANNa-MD relies on a convolutional neural network to detect valid morphological analogies using the above-mentionned embedding model (AlSaidi et al (a)). The model used is similar to the one by Lim et al. The embedding model and the clssifier ANNa-MD are trained together, using data from 11 different languages.

ANNa-MR uses multiple fully-connected layers to resolve a morphological analogical equation, i.e., an analogie with a missing element. Similarly to ANNa-MD, it uses the morphological embedding model we designed. The structure of the model is similar to what is proposed by Lim et al. From the embeddings of 3 words, ANNa-MR computes the embedding of the last word to complete the analogy. The word returned is the closest existing word to the model output in the embedding space.

A Neural Approach for Detecting Morphological Analogies

S. Alsaidi, A. Decker, P. Lay, E. Marquer, P.-A. Murena, M. Couceiro
DSAA, 2021

Analogical proportions are statements of the form "A is to B as C is to D" that are used for several reasoning and classification tasks in artificial intelligence and natural language processing (NLP). For instance, there are analogy based approaches to semantics as well as to morphology. In fact, symbolic approaches were developed to solve or to detect analogies between character strings, e.g., the axiomatic approach as well as that based on Kolmogorov complexity. In this paper, we propose a deep learning approach to detect morphological analogies, for instance, with reinflexion or conjugation. We present empirical results that show that our framework is competitive with the above-mentioned state of the art symbolic approaches. We also explore empirically its transferability capacity across languages, which highlights interesting similarities between them.

A Neural Approach for Detecting Morphological Analogies

S. Alsaidi, A. Decker, P. Lay, E. Marquer, P.-A. Murena, M. Couceiro
DSAA, 2021

On the Transferability of Neural Models of Morphological Analogies

S. Alsaidi, A. Decker, P. Lay, E. Marquer, P.-A. Murena, M. Couceiro
AIMLAI, 2021

Analogical proportions are statements expressed in the form "A is to B as C is to D" and are used for several reasoning and classification tasks in artificial intelligence and natural language processing (NLP). In this paper, we focus on morphological tasks and we propose a deep learning approach to detect morphological analogies. We present an empirical study to see how our framework transfers across languages, and that highlights interesting similarities and differences between these languages. In view of these results, we also discuss the possibility of building a multilingual morphological model.

Tackling Morphological Analogies Using Deep Learning - Extended Version

S. Alsaidi, A. Decker, P. Lay, E. Marquer, P.-A. Murena, M. Couceiro
arXiv preprint, 2021

Analogical proportions are statements of the form "A is to B as C is to D". They constitute an inference tool that provides a logical framework to address learning, transfer, and explainability concerns and that finds useful applications in artificial intelligence and natural language processing. In this paper, we address two problems, namely, analogy detection and resolution in morphology. Multiple symbolic approaches tackle the problem of analogies in morphology and achieve competitive performance. We show that it is possible to use a data-driven strategy to outperform those models. We propose an approach using deep learning to detect and solve morphological analogies. It encodes structural properties of analogical proportions and relies on a specifically designed embedding model capturing morphological characteristics of words. We demonstrate our model's competitive performance on analogy detection and resolution over multiple languages. We provide an empirical study to analyze the impact of balancing training data and evaluate the robustness of our approach to input perturbation.

Character-Aware Neural Language Models

Yoon Kim, Yacine Jernite, David Sontag, Alexander M. Rush
AAAI, 2016

We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway network over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing state-of-the-art despite having 60% fewer parameters. On languages with rich morphology (Arabic, Czech, French, German, Spanish, Russian), the model outperforms word-level/morpheme-level LSTM baselines, again with fewer parameters. The results suggest that on many languages, character inputs are sufficient for language modeling. Analysis of word representations obtained from the character composition part of the model reveals that the model is able to encode, from characters only, both semantic and orthographic information.

Contact

Address

Loria
Campus Scientifique
BP 239
54506 Vandoeuvre-lès-Nancy, France
Mail

cherif-hasssan.nousradine@loria.fr esteban.marquer@loria.fr
miguel.couceiro@loria.fr

Design: Esteban Marquer
Initial design: Dopetrope by AJ for HTML5 UP

ANNa

Learn more about ANNa

What are ANNa-MD and ANNa-MR?

A Neural Approach for Detecting Morphological Analogies

Related Articles

A Neural Approach for Detecting Morphological Analogies

On the Transferability of Neural Models of Morphological Analogies

Tackling Morphological Analogies Using Deep Learning - Extended Version

Character-Aware Neural Language Models

Contact

Address

Mail

ANNa

What are analogies?

What are morphological analogies?

What are ANNa-MD and ANNa-MR?

What are ANNa-MD and ANNa-MR?

New Papers

A Neural Approach for Detecting Morphological Analogies

On the Transferability of Neural Models of Morphological Analogies

Tackling Morphological Analogies Using Deep Learning - Extended Version

Character-Aware Neural Language Models

Address

Mail