Indoor topological slam using frontal computer vision

Boal Martín-Larrauri, Jaime

dc.contributor.advisor	Sánchez Miralles, Álvaro
dc.contributor.author	Boal Martín-Larrauri, Jaime
dc.contributor.other	Universidad Pontificia Comillas, Instituto de Investigación Tecnológica (IIT)	es_ES
dc.date.accessioned	2020-07-14T08:43:00Z
dc.date.available	2020-07-14T08:43:00Z
dc.date.issued	2014
dc.identifier.uri	http://hdl.handle.net/11531/48035
dc.description	Programa de Doctorado en Modelado de Sistemas de Ingeniería	es_ES
dc.description.abstract	En los últimos años se ha producido un gran avance hacia el desarrollo de robots móviles autónomos y es sólo cuestión de tiempo que se conviertan en un elemento habitual de nuestras vidas. Sin embargo, hay que abordar algunos problemas fundamentales antes de que un robot pueda desempeñar cualquier tarea de alto nivel. Uno de estos retos es el de dotar al robot con la habilidad de localizarse en un entorno previamente inexplorado, sin tener que proporcionarle el mapa de la zona con antelación. Así pues, cuando se coloque al robot en una nueva área, debería ser capaz de construir incrementalmente un mapa y determinar su posición dentro del mismo. A este problema se le denomina localización y mapeado simultáneos (SLAM, por sus siglas en inglés). El mapa puede ser un modelo métrico muy preciso o, por el contrario, seguir una estrategia topológica que recuerda a la forma intuitiva en que los seres humanos representamos el espacio. Tenemos una noción abstracta de la distancia pero aun así somos capaces de determinar dónde estamos usando la visión para identificar lugares distintos y la relación que existe entre ellos. Por consiguiente, si no necesitamos responder a la pregunta “¿Dónde estoy?” con precisión de milímetros y grados, ¿por qué debería un robot? Más aún, en aquellas aplicaciones en las que el robot y los seres humanos tengan que interactuar, el mapa debería ser idealmente un medio de comunicación común, por lo que cuanto más se parezca a la forma en que estructuramos la información espacial, más fácil debería ser dicha interacción. Esta tesis propone una solución al problema de la localización y mapeado simultáneos inspirado por el comportamiento humano. Se utiliza una cámara frontal como único sensor para hacer el sistema compatible con un amplio abanico de plataformas robóticas. Por medio de visión artificial, el robot extrae un conjunto de características complementarias (bordes verticales, información de color y puntos característicos) que se centran tanto en aspectos generales de la escena como en los detalles, y emplea un nuevo procedimiento para determinar la correspondencia entre características que se asienta en conceptos tomados del campo del procesamiento de lenguajes naturales. Las citadas características se utilizan para identificar automáticamente lugares susceptibles de ser considerados una ubicacion en el mapa, mediante un algoritmo de segmentación basado en la conectividad algebraica de un grafo. Cada vez que el robot llega a un nuevo lugar, se emplea una formulación bayesiana para decidir si la ubicación es nueva o se encuentra en una ya conocida, con el fin de actualizar el mapa topológico en consecuencia. Como evaluar continuamente todas las posibles combinaciones es computacionalmente inviable, se usa un filtro de partículas para tener en cuenta sólo aquellas topologías más probables.	es_ES
dc.description.abstract	The last few years have seen a great leap forward towards autonomous mobile robots, and it is just a matter of time that they become a regular part of our lives. However, some fundamental problems need to be addressed before a robot can be assigned to any particular high-level application. One of these challenges is to provide the robot with the ability of localizing itself in a previously unvisited environment without having to supply it with a map of the area in advance. Therefore, when the robot is moved to a new area, it should incrementally build a map on its own and determine its position within it. This is known as the Simultaneous Localization And Mapping (SLAM) problem. The map can be a very precise metric model or, alternatively, follow a topological approach that resembles human beings’ more intuitive representation of space. We have an abstract notion of distance but are still able to determine where we are using vision to identify distinct places and the transitions among them. Hence, if we do not need to answer the question “Where am I?” with precision of millimeters and degrees, why should a robot? Moreover, in those applications in which a robot and human beings need to interact, the map should ideally be a common communication framework, so the more similar the map is to the way we structure spatial information, the easier interaction would be. This thesis proposes a relatively computationally inexpensive solution to the SLAM problem inspired by human behavior. A forward-facing camera is the only sensor employed to make the system easily portable to a wide range of robotic platforms. By means of computer vision, the robot extracts a complementary collection of cues (vertical edges, color information, and keypoints) that focus both on the general characteristics of the scene and on the details, and employs a novel matching procedure that builds on concepts borrowed from the natural language processing field. These features are then used to automatically identify qualitatively different locations that are susceptible of being considered a place in the map using an online topological segmentation algorithm based on the algebraic connectivity of graphs. Every time the robot arrives at a place, a Bayesian formulation is employed to decide if the robot is in a new or an already known location in order to update the topological map accordingly. As keeping track of all possibilities over time is computationally intractable, a particle filter is used to take only the most probable topologies into account.	es_ES
dc.format.mimetype	application/pdf	es_ES
dc.language.iso	en	es_ES
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/us/	*
dc.subject	33 Ciencias tecnológicas	es_ES
dc.subject	3304 Tecnología de los ordenadores	es_ES
dc.subject	330417 Sistemas en tiempo real	es_ES
dc.subject	22 Física	es_ES
dc.subject	2209 Óptica	es_ES
dc.subject	220990 Tratamiento digital. Imágenes	es_ES
dc.title	Indoor topological slam using frontal computer vision	es_ES
dc.type	info:eu-repo/semantics/doctoralThesis	es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es_ES
dc.keywords	Visión artificial, Detección y correspondencia de características, Robots móviles, SLAM, Modelado topológico del entorno	es_ES
dc.keywords	Computer vision, Feature detection and matching, Mobile robots, SLAM, Topological modeling of the environment	es_ES