The Goodwill Consortium for AI research

 

We have joined forces in a collaborative consortium whose aim is to develop the concept of an integrated mathematical model that is able to represent syntactic and semantic aspects of language in a reduced lexicon, namely Basic English, and ultimately develop a general artificial cognition engine.  The fundamental concept lies in creating a mathematical means to create a multi-dimensional graphical representation of a sentence, in a space pre-conditioned by the inter-relationship of all words in this reduced lexicon.  To translate from standard English a wizard is employed, based on a vocabulary of 100,000+ words, to a representation using less than 2,000 words, which is available at www.simplish.org.  Thus, in a multidimensional space the central low-dimensionality points describe words and their relationship, while increasingly complex phrases are represented by further dimensionality points.  The core relationships have been derived from a single individual and then refined; unlike the majority of current efforts which rely on a large corpus of text, which have the disadvantage that ambiguity is a major problem.  A descriptive phrase may either be represented by a complex word (i.e. not one of the basic English lexicon) or it may represent an abstract  concept.  Our method is able to cope with both data-driven requirements and concept-driven information needed for problem-solving. 

Possibly the most important aspect of cognition has to do with memory. We know that the processes of acquisition, storage and retrieval of knowledge lie at the heart of human cognition. Furthermore, it has long been known that organization and memorization are inseparable and that memory is aided by meaning. Therefore, working out a way to establish the meaning of words in an artificial cognition system helps organization and is a crucial step in developing these systems. Our way to achieve this objective is to assign meaning in terms of the other words in a reduced vocabulary, unlike standard natural language where 100,000 words would have to be clustered somehow and related to each other.

 Relating the core 1,000 words of Basic English to each other is being done by another team within our Consortium using multivariate methods, which also provides means to create a multi-dimensional graphical representation of a sentence, in a space pre-conditioned by the inter-relationship of all words in this reduced lexicon.  Thus, these 1,000 words enable unique infinite expressiveness in a relatively low-dimensionality space.  This representation is conceptually similar to Chinese writing with ideograms. In this pre-conditioned space two phrases written using different words that convey similar meaning will be represented by a similar ideogram, which can be compared to existing data, analysed for concepts being searched for and/or extrapolated.  The net result will be to allow us to move from pattern-matching to concept-matching, in a multidimensional nearest-neighbour search.

In this broad strategy, it is possible to assign trajectories to each user, service or specific information in a collaborative network.  Moreover, the direction of these trajectories can be related to a planning path or the collection of the corresponding mission-derived data.  Trajectories may end on a known precursor condition that can be extrapolated to an actionable conclusion, while other users’ threads, which can be allocated degrees of priority/credibility, are used to corroborate or reject said action.  In this multi-threaded space information fusion is achievable and moreover, users can be added or deleted as they come into and out of a theater.

The consortium is currently collaborating on a first approximation to a general cognition system in the form of a conversational agent able to interact in standard English with a vocabulary of almost 100,000 words, endowed with memory, semantic association capabilities and a limited word-algebra capability, available in internet at www.rachaelrepp.org. This conversational agent is also being refined as a joint submission to the Loebner Prize for 2012 (http://loebner.net/Prizef/loebner-prize.html), a competition aimed at finding a computer system able to pass Turing´s test.

Finally, this broad approach can be helpful in the general field of human-machine interaction by enabling humans to express themselves in full-vocabulary natural language at the same time as machines only actually process semantically-filtered limited-vocabulary phrases.

 

Programme Partners:

 

¨     The Goodwill Company Ltd. (U.K.) – Provides overall system design and integration for defence applications.

¨     AVNTK S.C. (Mexico) – Provides IT facilities, computer science and AI expertise.

¨     Ardita Aeronautica (Mexico) – Responsible for adapting the technology to human-machine interaction applications.

¨     Natura Xalli (Mexico) – Responsible for semantics-manipulation, speech processing and adapting the technology to HMI applications.

¨     Universidad de Guadalajara (Mexico) – Provides undergraduate internship students to support IT development.

¨     Universidad Autonoma de Nayarit (Mexico) – Provides undergraduate internship students to support IT development.

¨     Universidad Marista (Mexico) – Provides undergraduate internship students to support IT development.

¨     Instituto Tecnológico de Ciudad Guzmán (Mexico) – Provides undergraduate internship students to support IT development.

 


Consortium Members :

Fiona K. Taylor, M.A., The Goodwill Company Ltd.

Dr. Marcelo Funes-Gallanzi, AVNTK S.C.

Dr. Luis Roberto Sahagún Ortiz, Ardita Aeronautica.

M.C. Eva Ivonne Torres Chávez, AVNTK S.C.

Ing. José Manuel Hernández Cruz, Natura Xalli

Dr. Juan Carlos Zuñiga, Universidad de Guadalajara

M.C. Ma. Rosario Montes Alvarez, Instituto Tecnologico de Cuidad Guzmán

Dr. Alejandro Lima, Universidad Marista de Guadalajara

Dr. Pablo Velarde, Universidad Autónoma de Nayarit

C. Edgar Carrillo, Universidad Autónoma de Nayarit

 

 

UM

Ardita

UDG

 

ITG

GW

UAN

NX