S.L.A.M. and Optical Tracking for XR

christina-wocintechchat-com-CmvA0xCDfC8-unsplash.jpg

S.L.A.M. and Optical Tracking for XR

Simultaneous Localization and Mapping is the computational problem of constructing or updating a map of an unknown environment while simultaneously keeping track of an model’s location within it. First I’ll start with the basics.

Tracking refers to the alignment of spatial properties. Registration is used to describe sensing and measurement of AR systems. For us to be a part of AR we must know where our position and orientation (pose) is in ‘real time’ and have it continuously updated. These are measured in 3-D position and 6-dimensional pose. This is also referred to as 3DoF (degrees of freedom)and 6DoF.

Registration in AR refers to the alignment to the coordinating systems between the real world and the virtual world. This requires the user’s head or camera within their smartphone, glasses, or headwear. There are 2 types of registration; static and dynamic registration. Static registration is when the user or camera is not moving requiring a calibration of the tracking system between the real world and AR. Dynamic registration is when the user or camera is moving and requires tracking.

TRACKING STYSTEMS

Stationary tracking systems were the first to become popular for virtual reality applications. As AR and VR progressed, Stationary Tracking is not as common in today’s systems, instead we use Mobile Tracking. Modern devices, smartphones in particular, have multiple sensors and access to wireless networks. While these devices already come equipped with all these sensors, they still find themselves in need of external sensors to connect with to produce an AR experience.

OPTICAL TRACKING

Optical methods of Tracking represent a set of computer vision algorithms and tracking devices such as a camera of visible or infrared range, a stereo camera, and a depth camera. Using images from these cameras requires combining them with a reference model. If a model is obtained before the tracking system, it is model-based tracking. The alternative is model-free tracking witch refers to when the model and tracking are being obtained at the same time. This is referred to as Simultaneous localization and mapping (or S.L.A.M.).

S.L.A.M. can combine 3D modeling with 3D tracking. This means that model-free tracking begins when the experience begins. With this type of tracking AR objects must be placed spontaneously and cannot be connected to the real world.

Markers and Natural features

When we talk about AR objects, we separate them into two categories; a marker or a natural feature. A marker or fiducial, help provide tracking for objects that lack variety in color and/or textures or repeating patterns that make it difficult for sensors differentiate between multiple features. Markers can come in different shapes and sizes but mostly are either round or square and provide high contrast typically in black and white.

Markers or fiducial


Markers or fiducial

Sphere shaped markers tend to have a greater advantage because they are the same shape as a lens or disc. The sphere is also ideal for tracking a user or other objects that are agile and change positions and orientation to the camera. They also have one main focal point as opposed to a quadrangular shape that requires a minimum of 3 focal points. The square design is common for barcodes and other stand alone objects of identification.

Alternatively, retro-reflective foil can be constructed as a markers. This material is commonly used in safety belts and on other safety clothing items and produces high contrast. This material is placed on spherical markers and placed around the camera to act as a high contrast markers.

Sometimes developers do not want to use markers or simply don’t have the means to make them so they then rely on natural features to communicate with sensors. Natural feature tracking requires high image quality and more computational resources. The most common types of these can be edge features or key frames assuming the outline features are easily detectable. Key frames are used but must be from a specific vantage point.

This is a natural feature in which the sensors from your phone detect the natural edge of the milk carton lid to provide an AR experience.

This is a natural feature in which the sensors from your phone detect the natural edge of the milk carton lid to provide an AR experience.

Illumination

For all these sensors to communicate we heavily rely on illumination. Illumination can be naturally occurring, passive illumination or active illumination.

Light sources that are not an integral part of the tracking system are called passive illumination. These passive illuminations occur naturally from the sun or semi-naturally from common sources such as ceiling lights or lamps. This can be tricky as the sun can be overly bright and indoor lighting can be a bit dim making image quality and markers hard to pick up.

Active light sources are one way developers and users can avoid becoming dependent on natural light sources. Infrared LED lighting that goes undetected by humans is a great way to illuminate the scene. LED beacons can be fixed or mounted and produce a high contrast image that is simply computed.

Infrared contrast

Structured lighting is another way to illuminate the scene with laser projections, RGB color, and depth sensors work together in commonly used RGB-D cameras. These are ideal for AR/VR systems as they deliver geographic information as well as images simultaneously (S.L.A.M.).

Unlike the pixelated images of cameras as opposed to the laser’s use of measurement which calculates the travel time of the light between the actual laser and object. This allows for more precise measurements over long distances and is often used in surveying.

Kinect by xbox uses an infrared laser light projector and camera as well as a RGB camera

SUMMARY

Simultaneous Localization and Mapping or S.L.A.M. is how we use track a model or user in the real world while simultaneously mapping out one’s environment to provide an AR experience. A huge part of this is with the use of optical tracking. With the use of markers and natural features we are able to communicate with sensors. These markers commonly made in to a square or circle. Natural feature tracking relies on hard edges and key frames to connect. Illumination is important to improve the communication between markers, sensors, and the camera. LED infrared lighting provides a high contrast image that is able to compute and register. Many tracking technologies come with a infrared laser, RGB or color camera, and an infrared camera and combine all inputs for the best S.L.A.M. tracking experience.





charissa suhr