Latest revision as of 17:51, 2 September 2022

Screenshot from visual odometry demonstration. Red squares are camera locations and colorized dots are part of sparse reconstruction used to estimate motion.

Stereo visual odometry estimates the camera's egomotion using a pair of calibrated cameras. Stereo camera systems are inherently more stable than monocular ones because the stereo pair provides good triangulation of image features and resolves the scale ambiguity. The example below shows how to use a high level interface with visual odometry algorithms. The basic high level interface hides internal data structures which are useful in many applications, which is why an optional interface is provided for accessing some of those structures.

Example File: ExampleVisualOdometryStereo.java

Concepts:

Structure from Motion
Geometric Vision
Feature Tracking

Relevant Videos:

Example Code

/**
 * Bare bones example showing how to estimate the camera's ego-motion using a stereo camera system. Additional
 * information on the scene can be optionally extracted from the algorithm if it implements AccessPointTracks3D.
 *
 * @author Peter Abeles
 */
public class ExampleVisualOdometryStereo {
	public static void main( String[] args ) {
		MediaManager media = DefaultMediaManager.INSTANCE;

		String directory = UtilIO.pathExample("vo/backyard/");

		// load camera description and the video sequence
		StereoParameters stereoParam = CalibrationIO.load(media.openFile(new File(directory, "stereo.yaml").getPath()));
		SimpleImageSequence<GrayU8> video1 = media.openVideo(
				new File(directory, "left.mjpeg").getPath(), ImageType.single(GrayU8.class));
		SimpleImageSequence<GrayU8> video2 = media.openVideo(
				new File(directory, "right.mjpeg").getPath(), ImageType.single(GrayU8.class));

		var config = new ConfigStereoMonoTrackPnP();

		// Specify which tracker and how it will behave
		config.tracker.typeTracker = ConfigPointTracker.TrackerType.KLT;
		config.tracker.klt.pyramidLevels = ConfigDiscreteLevels.levels(4);
		config.tracker.klt.templateRadius = 4;
		config.tracker.klt.toleranceFB = 3;
		config.tracker.klt.pruneClose = true;

		config.tracker.detDesc.detectPoint.type = PointDetectorTypes.SHI_TOMASI;
		config.tracker.detDesc.detectPoint.shiTomasi.radius = 4;
		config.tracker.detDesc.detectPoint.general.maxFeatures = 300;
		config.tracker.detDesc.detectPoint.general.radius = 5;

		// We will estimate the location of features using block matching stereo
		config.disparity.errorType = DisparityError.CENSUS;
		config.disparity.disparityMin = 0;
		config.disparity.disparityRange = 50;
		config.disparity.regionRadiusX = 3;
		config.disparity.regionRadiusY = 3;
		config.disparity.maxPerPixelError = 30;
		config.disparity.texture = 0.05;
		config.disparity.validateRtoL = 1;
		config.disparity.subpixel = true;

		// Configurations related to how the structure is chained together frame to frame
		config.scene.keyframes.geoMinCoverage = 0.4;
		config.scene.ransac.iterations = 200;
		config.scene.ransac.inlierThreshold = 1.0;

		// Declare each component then visual odometry
		StereoVisualOdometry<GrayU8> visodom = FactoryVisualOdometry.stereoMonoPnP(config, GrayU8.class);

		// Optionally dump verbose debugging information to stdout
//		visodom.setVerbose(System.out, BoofMiscOps.hashSet(BoofVerbose.RUNTIME, VisualOdometry.VERBOSE_TRACKING));

		// Pass in intrinsic/extrinsic calibration. This can be changed in the future.
		visodom.setCalibration(stereoParam);

		// Process the video sequence and output the location plus number of inliers
		long startTime = System.nanoTime();
		while (video1.hasNext()) {
			GrayU8 left = video1.next();
			GrayU8 right = video2.next();

			if (!visodom.process(left, right)) {
				throw new RuntimeException("VO Failed!");
			}

			Se3_F64 leftToWorld = visodom.getCameraToWorld();
			Vector3D_F64 T = leftToWorld.getT();

			System.out.printf("Location %8.2f %8.2f %8.2f, %s\n", T.x, T.y, T.z, trackStats(visodom));
		}
		System.out.printf("FPS %4.2f\n", video1.getFrameNumber()/((System.nanoTime() - startTime)*1e-9));
	}

	/**
	 * If the algorithm implements AccessPointTracks3D create a string which summarizing different tracking information
	 */
	public static String trackStats( VisualOdometry alg ) {
		if (!(alg instanceof AccessPointTracks3D))
			return "";

		var access = (AccessPointTracks3D)alg;

		int N = access.getTotalTracks();
		int totalInliers = 0;
		int totalNew = 0;
		for (int i = 0; i < N; i++) {
			if (access.isTrackInlier(i))
				totalInliers++;

			if (access.isTrackNew(i))
				totalNew++;
		}

		return String.format("inlier: %5.1f%% new %4d total %d", 100.0*totalInliers/N, totalNew, N);
	}
}

Difference between revisions of "Example Stereo Visual Odometry"

Latest revision as of 17:51, 2 September 2022

Example Code

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools

@@ Line 1: / Line 1: @@
-= Stereo Visual Odometry Example =
 <center>
 <gallery widths=758px heights=384px >
-File:Stereo_visual_odometry_screenshot.jpg|Screenshot from visual odometry applet.  Example just outputs estimated location.
+File:Stereo_visual_odometry_screenshot.jpg|Screenshot from visual odometry demonstration.  Red squares are camera locations and colorized dots are part of sparse reconstruction used to estimate motion.
 </gallery>
 </center>
-Stereo visual odometry estimates the camera's egomotion using a pair of calibrated cameras.  Stereo camera systems are inheritly more stable than monocular ones because the stereo pair provides good triangulation of image features and resolves the scale ambiguity.   The example below shows how to use a high level interface with visual odometyr algorithms.  The basic high level interface hides internal data structures which are useful in many applications, which is why an optional interface is provided for accessing some of those structures.f point features and model fitting is required.
+Stereo visual odometry estimates the camera's egomotion using a pair of calibrated cameras.  Stereo camera systems are inherently more stable than monocular ones because the stereo pair provides good triangulation of image features and resolves the scale ambiguity.   The example below shows how to use a high level interface with visual odometry algorithms.  The basic high level interface hides internal data structures which are useful in many applications, which is why an optional interface is provided for accessing some of those structures.
-Example File: [https://github.com/lessthanoptimal/BoofCV/blob/master/examples/src/boofcv/examples/ExampleStereoVisualOdometry.java ExampleStereoVisualOdometry.java]
+Example File: [https://github.com/lessthanoptimal/BoofCV/blob/v0.41/examples/src/main/java/boofcv/examples/sfm/ExampleVisualOdometryStereo.java ExampleVisualOdometryStereo.java]
 Concepts:
@@ Line 16: / Line 14: @@
 * Feature Tracking
-Relevant Applets:
+Relevant Videos:
-* [[Applet_Feature_Tracking| Feature Tracking]]
+* [https://www.youtube.com/watch?v=8pn9Ebw90uk&t=292s Video 2020]
-* [[Applet_Stereo_Visual_Odometry| Stereo Visual Odometry]]
+* [https://www.youtube.com/watch?v=D4I4NHSGaOc Video 2013]
 Related Tutorials/Example Code:
 * [[Tutorial_Geometric_Vision| Tutorial Geometric Vision]]
+* [[Example Visual Odometry Monocular Plane| Visual Odometry Monocular Plane]]
+* [[Example Visual Odometry Depth| Visual Odometry Depth]]
+= Example Code =
 <syntaxhighlight lang="java">
@@ Line 30: / Line 32: @@
   * @author Peter Abeles
   */
-public class ExampleStereoVisualOdometry {
+public class ExampleVisualOdometryStereo {
+	public static void main( String[] args ) {
+		MediaManager media = DefaultMediaManager.INSTANCE;
-	public static void main( String args[] ) {
+		String directory = UtilIO.pathExample("vo/backyard/");
-		MediaManager media = DefaultMediaManager.INSTANCE;
+		// load camera description and the video sequence
+		StereoParameters stereoParam = CalibrationIO.load(media.openFile(new File(directory, "stereo.yaml").getPath()));
+		SimpleImageSequence<GrayU8> video1 = media.openVideo(
+				new File(directory, "left.mjpeg").getPath(), ImageType.single(GrayU8.class));
+		SimpleImageSequence<GrayU8> video2 = media.openVideo(
+				new File(directory, "right.mjpeg").getPath(), ImageType.single(GrayU8.class));
+		var config = new ConfigStereoMonoTrackPnP();
+		// Specify which tracker and how it will behave
+		config.tracker.typeTracker = ConfigPointTracker.TrackerType.KLT;
+		config.tracker.klt.pyramidLevels = ConfigDiscreteLevels.levels(4);
+		config.tracker.klt.templateRadius = 4;
+		config.tracker.klt.toleranceFB = 3;
+		config.tracker.klt.pruneClose = true;
-		String directory = "../data/applet/vo/backyard/";
+		config.tracker.detDesc.detectPoint.type = PointDetectorTypes.SHI_TOMASI;
+		config.tracker.detDesc.detectPoint.shiTomasi.radius = 4;
+		config.tracker.detDesc.detectPoint.general.maxFeatures = 300;
+		config.tracker.detDesc.detectPoint.general.radius = 5;
-		// load camera description and the video sequence
+		// We will estimate the location of features using block matching stereo
-		StereoParameters config = BoofMiscOps.loadXML(media.openFile(directory+"stereo.xml"));
+		config.disparity.errorType = DisparityError.CENSUS;
-		SimpleImageSequence<ImageUInt8> video1 = media.openVideo(directory+"left.mjpeg",ImageUInt8.class);
+		config.disparity.disparityMin = 0;
-		SimpleImageSequence<ImageUInt8> video2 = media.openVideo(directory+"right.mjpeg",ImageUInt8.class);
+		config.disparity.disparityRange = 50;
+		config.disparity.regionRadiusX = 3;
+		config.disparity.regionRadiusY = 3;
+		config.disparity.maxPerPixelError = 30;
+		config.disparity.texture = 0.05;
+		config.disparity.validateRtoL = 1;
+		config.disparity.subpixel = true;
-		// specify how the image features are going to be tracked
+		// Configurations related to how the structure is chained together frame to frame
-		ImagePointTracker<ImageUInt8> tracker =
+		config.scene.keyframes.geoMinCoverage = 0.4;
-				FactoryPointSequentialTracker.klt(600,new int[]{1,2,4,8},3,3,2,ImageUInt8.class, ImageSInt16.class);
+		config.scene.ransac.iterations = 200;
+		config.scene.ransac.inlierThreshold = 1.0;
-		// computes the depth of each point
+		// Declare each component then visual odometry
-		StereoDisparitySparse<ImageUInt8> disparity =
+		StereoVisualOdometry<GrayU8> visodom = FactoryVisualOdometry.stereoMonoPnP(config, GrayU8.class);
-				FactoryStereoDisparity.regionSparseWta(0, 150, 3, 3, 30, -1, true, ImageUInt8.class);
-		// declares the algorithm
+		// Optionally dump verbose debugging information to stdout
-		StereoVisualOdometry<ImageUInt8> visualOdometry = FactoryVisualOdometry.stereoDepth(120, 2,
+//		visodom.setVerbose(System.out, BoofMiscOps.hashSet(BoofVerbose.RUNTIME, VisualOdometry.VERBOSE_TRACKING));
-.5, tracker, disparity, 0, ImageUInt8.class);
-		// Pass in intrinsic/extrinsic calibration.  This can be changed in the future.
+		// Pass in intrinsic/extrinsic calibration. This can be changed in the future.
-		visualOdometry.setCalibration(config);
+		visodom.setCalibration(stereoParam);
 		// Process the video sequence and output the location plus number of inliers
-		while( video1.hasNext() ) {
+		long startTime = System.nanoTime();
-			ImageUInt8 left = video1.next();
+		while (video1.hasNext()) {
-			ImageUInt8 right = video2.next();
+			GrayU8 left = video1.next();
+			GrayU8 right = video2.next();
-			if( !visualOdometry.process(left,right) ) {
+			if (!visodom.process(left, right)) {
 				throw new RuntimeException("VO Failed!");
 			}
-			Se3_F64 leftToWorld = visualOdometry.getLeftToWorld();
+			Se3_F64 leftToWorld = visodom.getCameraToWorld();
 			Vector3D_F64 T = leftToWorld.getT();
-			System.out.printf("Location %8.2f %8.2f %8.2f      inliers %s\n", T.x, T.y, T.z,countInliers(visualOdometry));
+			System.out.printf("Location %8.2f %8.2f %8.2f, %s\n", T.x, T.y, T.z, trackStats(visodom));
 		}
+		System.out.printf("FPS %4.2f\n", video1.getFrameNumber()/((System.nanoTime() - startTime)*1e-9));
 	}
 	/**
-	 * If the algorithm implements AccessPointTracks3D, then count the number of inlier features
+	 * If the algorithm implements AccessPointTracks3D create a string which summarizing different tracking information
-	 * and return a string.
 	 */
-	public static String countInliers( StereoVisualOdometry alg ) {
+	public static String trackStats( VisualOdometry alg ) {
-		if( !(alg instanceof AccessPointTracks3D))
+		if (!(alg instanceof AccessPointTracks3D))
 			return "";
-		AccessPointTracks3D access = (AccessPointTracks3D)alg;
+		var access = (AccessPointTracks3D)alg;
+		int N = access.getTotalTracks();
+		int totalInliers = 0;
+		int totalNew = 0;
+		for (int i = 0; i < N; i++) {
+			if (access.isTrackInlier(i))
+				totalInliers++;
-		int count = 0;
+			if (access.isTrackNew(i))
-		int N = access.getAllTracks().size();
+				totalNew++;
-		for( int i = 0; i < N; i++ ) {
-			if( access.isInlier(i) )
-				count++;
 		}
-		return Integer.toString(count);
+		return String.format("inlier: %5.1f%% new %4d total %d", 100.0*totalInliers/N, totalNew, N);
 	}
 }
 </syntaxhighlight>