Indoor topological slam using frontal computer vision

Boal Martín-Larrauri, Jaime

View/Open

Tesis Doctoral (2.648Mb)

Autorización (269.5Kb)

Date

2014

Abstract

En los últimos años se ha producido un gran avance hacia el desarrollo de robots móviles autónomos y es sólo cuestión de tiempo que se conviertan en un elemento habitual de nuestras vidas. Sin embargo, hay que abordar algunos problemas fundamentales antes de que un robot pueda desempeñar cualquier tarea de alto nivel. Uno de estos retos es el de dotar al robot con la habilidad de localizarse en un entorno previamente inexplorado, sin tener que proporcionarle el mapa de la zona con antelación. Así pues, cuando se coloque al robot en una nueva área, debería ser capaz de construir incrementalmente un mapa y determinar su posición dentro del mismo. A este problema se le denomina localización y mapeado simultáneos (SLAM, por sus siglas en inglés). El mapa puede ser un modelo métrico muy preciso o, por el contrario, seguir una estrategia topológica que recuerda a la forma intuitiva en que los seres humanos representamos el espacio. Tenemos una noción abstracta de la distancia pero aun así somos capaces de determinar dónde estamos usando la visión para identificar lugares distintos y la relación que existe entre ellos. Por consiguiente, si no necesitamos responder a la pregunta “¿Dónde estoy?” con precisión de milímetros y grados, ¿por qué debería un robot? Más aún, en aquellas aplicaciones en las que el robot y los seres humanos tengan que interactuar, el mapa debería ser idealmente un medio de comunicación común, por lo que cuanto más se parezca a la forma en que estructuramos la información espacial, más fácil debería ser dicha interacción. Esta tesis propone una solución al problema de la localización y mapeado simultáneos inspirado por el comportamiento humano. Se utiliza una cámara frontal como único sensor para hacer el sistema compatible con un amplio abanico de plataformas robóticas. Por medio de visión artificial, el robot extrae un conjunto de características complementarias (bordes verticales, información de color y puntos característicos) que se centran tanto en aspectos generales de la escena como en los detalles, y emplea un nuevo procedimiento para determinar la correspondencia entre características que se asienta en conceptos tomados del campo del procesamiento de lenguajes naturales. Las citadas características se utilizan para identificar automáticamente lugares susceptibles de ser considerados una ubicacion en el mapa, mediante un algoritmo de segmentación basado en la conectividad algebraica de un grafo. Cada vez que el robot llega a un nuevo lugar, se emplea una formulación bayesiana para decidir si la ubicación es nueva o se encuentra en una ya conocida, con el fin de actualizar el mapa topológico en consecuencia. Como evaluar continuamente todas las posibles combinaciones es computacionalmente inviable, se usa un filtro de partículas para tener en cuenta sólo aquellas topologías más probables.

The last few years have seen a great leap forward towards autonomous mobile robots, and it is just a matter of time that they become a regular part of our lives. However, some fundamental problems need to be addressed before a robot can be assigned to any particular high-level application. One of these challenges is to provide the robot with the ability of localizing itself in a previously unvisited environment without having to supply it with a map of the area in advance. Therefore, when the robot is moved to a new area, it should incrementally build a map on its own and determine its position within it. This is known as the Simultaneous Localization And Mapping (SLAM) problem. The map can be a very precise metric model or, alternatively, follow a topological approach that resembles human beings’ more intuitive representation of space. We have an abstract notion of distance but are still able to determine where we are using vision to identify distinct places and the transitions among them. Hence, if we do not need to answer the question “Where am I?” with precision of millimeters and degrees, why should a robot? Moreover, in those applications in which a robot and human beings need to interact, the map should ideally be a common communication framework, so the more similar the map is to the way we structure spatial information, the easier interaction would be. This thesis proposes a relatively computationally inexpensive solution to the SLAM problem inspired by human behavior. A forward-facing camera is the only sensor employed to make the system easily portable to a wide range of robotic platforms. By means of computer vision, the robot extracts a complementary collection of cues (vertical edges, color information, and keypoints) that focus both on the general characteristics of the scene and on the details, and employs a novel matching procedure that builds on concepts borrowed from the natural language processing field. These features are then used to automatically identify qualitatively different locations that are susceptible of being considered a place in the map using an online topological segmentation algorithm based on the algebraic connectivity of graphs. Every time the robot arrives at a place, a Bayesian formulation is employed to decide if the robot is in a new or an already known location in order to update the topological map accordingly. As keeping track of all possibilities over time is computationally intractable, a particle filter is used to take only the most probable topologies into account.

URI

http://hdl.handle.net/11531/48035

Tesis Doctoral

Indoor topological slam using frontal computer vision

Titulación / Programa

Programa de Doctorado en Modelado de Sistemas de Ingeniería

Materias/ UNESCO

33 Ciencias tecnológicas
3304 Tecnología de los ordenadores
330417 Sistemas en tiempo real
22 Física
2209 Óptica
220990 Tratamiento digital. Imágenes

Palabras Clave

Visión artificial, Detección y correspondencia de características, Robots móviles, SLAM, Modelado topológico del entorno
Computer vision, Feature detection and matching, Mobile robots, SLAM, Topological modeling of the environment

Collections

Tesis Doctorales

The following license files are associated with this item:

Creative Commons

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States