Skip to main content

2024 | OriginalPaper | Buchkapitel

Offline Deep Model Predictive Control (MPC) for Visual Navigation

verfasst von : Taha Bouzid, Youssef Alj

Erschienen in: Robotics, Computer Vision and Intelligent Systems

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose a new visual navigation method based on a single RGB perspective camera. Using the Visual Teach & Repeat (VT &R) methodology [8], the robot acquires a visual trajectory consisting of multiple subgoal images in the teaching step. In the repeat step, we propose two network architectures, namely ViewNet and VelocityNet. The combination of the two networks allows the robot to follow the visual trajectory. ViewNet is trained to generate a future image based on the current view and the velocity command. The generated future image is combined with the subgoal image for training VelocityNet. We develop an offline Model Predictive Control (MPC) policy within VelocityNet with the dual goals of (1) reducing the difference between current and subgoal images and (2) ensuring smooth trajectories by mitigating velocity discontinuities. Offline training conserves computational resources, making it a more suitable option for scenarios with limited computational capabilities, such as embedded systems. We validate our experiments in a simulation environment, demonstrating that our model can effectively minimize the metric error between real and played trajectories.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chaumette, F., Hutchinson, S.: Visual servo control. I. Basic approaches. IEEE Robot. Autom. Mag. 13(4), 82–90 (2006) Chaumette, F., Hutchinson, S.: Visual servo control. I. Basic approaches. IEEE Robot. Autom. Mag. 13(4), 82–90 (2006)
2.
Zurück zum Zitat Chung, C.M., et al.: Orbeez-SLAM: a real-time monocular visual SLAM with ORB features and NeRF-realized mapping. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 9400–9406. IEEE (2023) Chung, C.M., et al.: Orbeez-SLAM: a real-time monocular visual SLAM with ORB features and NeRF-realized mapping. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 9400–9406. IEEE (2023)
3.
Zurück zum Zitat Diosi, A., Remazeilles, A., Segvic, S., Chaumette, F.: Outdoor visual path following experiments. In: 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4265–4270. IEEE (2007) Diosi, A., Remazeilles, A., Segvic, S., Chaumette, F.: Outdoor visual path following experiments. In: 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4265–4270. IEEE (2007)
4.
Zurück zum Zitat Diosi, A., Segvic, S., Remazeilles, A., Chaumette, F.: Experimental evaluation of autonomous driving based on visual memory and image-based visual servoing. IEEE Trans. Intell. Transp. Syst. 12(3), 870–883 (2011)CrossRef Diosi, A., Segvic, S., Remazeilles, A., Chaumette, F.: Experimental evaluation of autonomous driving based on visual memory and image-based visual servoing. IEEE Trans. Intell. Transp. Syst. 12(3), 870–883 (2011)CrossRef
5.
Zurück zum Zitat Eysenbach, B., Salakhutdinov, R.R., Levine, S.: Search on the replay buffer: bridging planning and reinforcement learning. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’ Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc. (2019) Eysenbach, B., Salakhutdinov, R.R., Levine, S.: Search on the replay buffer: bridging planning and reinforcement learning. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’ Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc. (2019)
6.
Zurück zum Zitat Felton, S., Fromont, É., Marchand, E.: Deep metric learning for visual servoing: when pose and image meet in latent space. In: IEEE International Conference on Robotics and Automation (2023) Felton, S., Fromont, É., Marchand, E.: Deep metric learning for visual servoing: when pose and image meet in latent space. In: IEEE International Conference on Robotics and Automation (2023)
7.
Zurück zum Zitat Finn, C., Levine, S.: Deep visual foresight for planning robot motion. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 2786–2793. IEEE (2017) Finn, C., Levine, S.: Deep visual foresight for planning robot motion. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 2786–2793. IEEE (2017)
8.
Zurück zum Zitat Furgale, P., Barfoot, T.D.: Visual teach and repeat for long-range rover autonomy. J. Field Robot. 27(5), 534–560 (2010)CrossRef Furgale, P., Barfoot, T.D.: Visual teach and repeat for long-range rover autonomy. J. Field Robot. 27(5), 534–560 (2010)CrossRef
9.
Zurück zum Zitat Gupta, S., Davidson, J., Levine, S., Sukthankar, R., Malik, J.: Cognitive mapping and planning for visual navigation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2616–2625 (2017) Gupta, S., Davidson, J., Levine, S., Sukthankar, R., Malik, J.: Cognitive mapping and planning for visual navigation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2616–2625 (2017)
10.
Zurück zum Zitat Hirose, N., Sadeghian, A., Vázquez, M., Goebel, P., Savarese, S.: GONet: a semi-supervised deep learning approach for traversability estimation. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3044–3051 (2018). https://doi.org/10.1109/IROS.2018.8594031 Hirose, N., Sadeghian, A., Vázquez, M., Goebel, P., Savarese, S.: GONet: a semi-supervised deep learning approach for traversability estimation. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3044–3051 (2018). https://​doi.​org/​10.​1109/​IROS.​2018.​8594031
11.
Zurück zum Zitat Hirose, N., Sadeghian, A., Xia, F., Martín-Martín, R., Savarese, S.: VUNet: dynamic scene view synthesis for traversability estimation using an RGB camera. IEEE Robot. Autom. Lett. 4(2), 2062–2069 (2019)CrossRef Hirose, N., Sadeghian, A., Xia, F., Martín-Martín, R., Savarese, S.: VUNet: dynamic scene view synthesis for traversability estimation using an RGB camera. IEEE Robot. Autom. Lett. 4(2), 2062–2069 (2019)CrossRef
12.
Zurück zum Zitat Hirose, N., Xia, F., Martín-Martín, R., Sadeghian, A., Savarese, S.: Deep visual MPC-policy learning for navigation. IEEE Robot. Autom. Lett. 4(4), 3184–3191 (2019)CrossRef Hirose, N., Xia, F., Martín-Martín, R., Sadeghian, A., Savarese, S.: Deep visual MPC-policy learning for navigation. IEEE Robot. Autom. Lett. 4(4), 3184–3191 (2019)CrossRef
13.
Zurück zum Zitat Kwon, O., Kim, N., Choi, Y., Yoo, H., Park, J., Oh, S.: Visual graph memory with unsupervised representation for visual navigation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 15890–15899 (2021) Kwon, O., Kim, N., Choi, Y., Yoo, H., Park, J., Oh, S.: Visual graph memory with unsupervised representation for visual navigation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 15890–15899 (2021)
14.
Zurück zum Zitat MG, H.N., Antony, R.: MPC based path planning for wheeled mobile robots in environments with varying slip. In: Advances in Robotics - 5th International Conference of The Robotics Society, AIR 2021. Association for Computing Machinery, New York (2022). https://doi.org/10.1145/3478586.3478601 MG, H.N., Antony, R.: MPC based path planning for wheeled mobile robots in environments with varying slip. In: Advances in Robotics - 5th International Conference of The Robotics Society, AIR 2021. Association for Computing Machinery, New York (2022). https://​doi.​org/​10.​1145/​3478586.​3478601
16.
Zurück zum Zitat Shah, D., Bhorkar, A., Leen, H., Kostrikov, I., Rhinehart, N., Levine, S.: Offline reinforcement learning for visual navigation. In: 6th Annual Conference on Robot Learning (2022) Shah, D., Bhorkar, A., Leen, H., Kostrikov, I., Rhinehart, N., Levine, S.: Offline reinforcement learning for visual navigation. In: 6th Annual Conference on Robot Learning (2022)
17.
Zurück zum Zitat Tanguy, A., De Simone, D., Comport, A.I., Oriolo, G., Kheddar, A.: Closed-loop MPC with dense visual SLAM-stability through reactive stepping. In: 2019 International Conference on Robotics and Automation (ICRA), pp. 1397–1403. IEEE (2019) Tanguy, A., De Simone, D., Comport, A.I., Oriolo, G., Kheddar, A.: Closed-loop MPC with dense visual SLAM-stability through reactive stepping. In: 2019 International Conference on Robotics and Automation (ICRA), pp. 1397–1403. IEEE (2019)
19.
Zurück zum Zitat Zhao, W., Queralta, J.P., Westerlund, T.: Sim-to-real transfer in deep reinforcement learning for robotics: a survey. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 737–744. IEEE (2020) Zhao, W., Queralta, J.P., Westerlund, T.: Sim-to-real transfer in deep reinforcement learning for robotics: a survey. In: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 737–744. IEEE (2020)
20.
Zurück zum Zitat Zhu, Y., et al.: Target-driven visual navigation in indoor scenes using deep reinforcement learning. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 3357–3364. IEEE (2017) Zhu, Y., et al.: Target-driven visual navigation in indoor scenes using deep reinforcement learning. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 3357–3364. IEEE (2017)
21.
Zurück zum Zitat Zhu, Z., et al.: Nice-SLAM: neural implicit scalable encoding for SLAM. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12786–12796 (2022) Zhu, Z., et al.: Nice-SLAM: neural implicit scalable encoding for SLAM. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12786–12796 (2022)
Metadaten
Titel
Offline Deep Model Predictive Control (MPC) for Visual Navigation
verfasst von
Taha Bouzid
Youssef Alj
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-59057-3_9

Premium Partner