Technische
            Universität Braunschweig
  • Homepage
  • Team
  • Research
    • Group Ernst
    • Group Jukan
    • Group Michalik
    • AG Gomez
    • TUBS.digital
  • Teaching
    • Courses
    • Student research
    • Seminar topics
    • International Internships
  • Job offers
    • DE
    • EN
Logo Institut für Datentechnik und Kommunikationsnetze der TU Braunschweig
  • IPF 2.0
    • Motivation
    • Role of IDA
    • Project partners
    • People at IDA
    • Project-related publications

IPF 2.0


Conquering MPSoC Complexity with Principles of a Self-Aware Information Processing Factory (IPF) – Towards networked and data-centric Platforms

The IPF project is a transatlantic collaboration among University of California, Irvine, TU Munich and TU Braunschweig, jointly funded by the Deutsche Forschungsgemeinschaft (German Research Foundation - DFG) and the US National Science Foundation (NSF).

You can find the website of our partners here.
Official DFG project summary

IPF stands for a novel concept in the management of large and complex heterogeneous Multi-Processor Systems-on-Chip (MPSoC) towards autonomously operating embedded and cyber-physical computer systems. Like a smart factory that uses a data-centric infrastructure and AI to integrate production, supply management, facility management, operation and maintenance, IPF envisions an integrated self-management of future connected MPSoC. To that end, IPF uses the principles of self-awareness and cognitive self-organization. It combines the many sources of information from sensors, application profiles and requirements (e.g., safety), expected changes and workload situations to identify potential future MPSoC system failures (“imminent hazards”) and optimize resource usage (e.g., energy) as well as system lifetimes.

IPF has 2 phases. Phase 1 (IPF 1, 2018-2022) focused on the management of an individual MPSoC.  It formulated the fundamental challenges, derived terminology and methods, and provided solutions for autonomous operation under long-term safety and availability requirements. The results take continuous alterations into account, like a changing environment, system dynamics, and degradation. Literature and press coverage can be found here. A prominent use case is a pacemaker ["The Information Processing Factory: A Paradigm for Life Cycle Management of Dependable Systems"; Eberle A. Rambo, Thawra Kadeed, Rolf Ernst, et. al.].

The second phase, IPF2.0, (2022-2025) extends the principles and applications to networks of MPSoC systems. In a first stage, multiple MPSoCs are connected via local networks, an infrastructure found in modern vehicles, in aircraft, or in robots. Such systems are still developed in a coherent design process, with a fixed network topology and a single owner. This stage can be likened to combine several production lines in a larger factory, to stay in the factory metaphor. The second stage of IPF 2.0 researches into solutions for orchestrating such local systems in larger multi-owner networks, such as known from industrial supply chains. IPF2.0 uses truck platooning as a related use case, where trucks orchestrate ad hoc to form a loosely coupled system exchanging data for joint operation in traffic.

The IPF2.0 consortium is in close contact to a Korean partner at Kookmin University, that put together a platoon demonstrator and a test environment to evaluate management abilities in a platoon. You can find a demonstrative video of the Kookmin platoon installation here.


Motivation

As mentioned above, the extension of the application field of the principles developed in IPF1.0 will take place in two stages. In the first stage, we will move from standalone multicore chips to a group of networked processors. As mentioned above, this would be equivalent to different highly automated and self-managed production lines in a factory being integrated. The context of this integration remains within a factory ground with a fixed topology and a shared owner. Applying this to networked MPSoCs, this transition poses some significant challenges to the underlying communication network. Mechanisms of continuous monitoring and self-awareness are key components to handle everchanging environmental and system-specific dynamics. As these are the central mechanisms that allow for a smart self-aware processing factory, they already dominated research in the previous IPF1.0 project on multi-/manycore architecture level. On the level of a networked platform, self-awareness can only be achieved, if data (e.g., meta data like status or reliability information, but also processed data) can be exchanged between the processors in a safe and timely manner. Therefore, the compliance to requirements like timing, performance, and correctness of the data transfer has a significant influence on the safe operation of the whole distributed system.

 

Typically, networks in safety-critical systems like state-of-the-art vehicles are statically configured as locations and availability of data producers (e.g., sensors) and data consumers (e.g., sensor fusion) is known at design time. However, adaptation to dynamic changes within the system is a central mechanism of the information processing factory idea. Metaphorically, processing factories cannot stop operation. Environmental changes and system dynamics endangering the factory process will be identified through monitoring and result in adjustments of the processing line to ensure continuous operation. Enabling a system to self-adapt and adjust its operation improves the system’s longevity significantly. Therefore, in the application field of networked MPSoCs, the network control and data distribution mechanisms shall be broadened to support system dynamics. In safety-critical systems dynamic changes in operation and configurations are not considered, as it substantially complicates safety verification and certification. The concept of a Resource Manager (initially developed in phase 1 of IPF), allows for reconfigurations of the communication network while verifiability remains, and safe operation can be guaranteed at all times.

 

In IPF1.0, a demonstrator of an in-vehicle network was put together to evaluate and verify the Resource Manager’s functionality. This demonstrator will be extended and further utilized in phase 2 of IPF.

The previously mentioned dynamics are only amplified in stage two, when we consider cooperation of distributed systems. In the processing factory metaphor, this would equal a cooperation of independent factory grounds in different locations or for different stages in the supply chain. Within the IPF2.0 consortium, truck platooning is considered as an example of such a system as already stated. Data distribution in a platoon is even more dynamic, as platoon members can join and leave the platoon frequently and with that, taking or bringing in new data producers or data consumers into a system that is connected by a lossy communication medium only. The IPF2.0 project investigates opportunities to support such dynamism in order to enable or enhance safety-critical applications such as truck platooning. Adjusting and optimizing the data transport in dynamic scenarios supports composability while allowing to improve energy efficiency and performance.


Role of IDA

The requirements on traffic in vehicular networks such as those of the trucks in the platooning use case pose as a significant challenge for future vehicular networks due to the dynamism and varying communication semantics. Meanwhile, there are still stringent safety and timing constraints that must be taken into account at all times. At the Institute for Computer and Network Engineering we develop protocols and mechanisms to address this issue.

Optimization of data management with Dynamic NUMA
First, we aim to tackle architectural challenges caused by the high timing and reliability requirements on data availability and data distribution by taking a data-centric perspective. Taking distributed shared memory architectures (with NUMA) into consideration for vehicular systems with multiple distributed ECUs, allows us to introduce caching mechanisms to be used in these distributed systems. Considering the vehicular use case, potentially large objects, such as processed or raw sensor (e.g., Camera or Lidar) data, needs to be exchanged. Limiting data transmission to the truly needed data portions only, can improve the performance but also energy efficiency of the system. A data distribution concept will be developed that allows for fragmental data updates (in line with traditional hardware data caching mechanism found in modern processors) and burst-like object caching transfers (coinciding with publish/subscribe data distribution found in DDS/RTPS). The combined caching mechanism shall adapt the data distribution pattern best fit for the situation and traffic pattern. Further, due to the vast exchange of large data objects caused by the distributed nature of future vehicular computing platforms and the sheer number of sensors in future vehicles, approaches to optimize the data locality and exchange will be examined.

Network management via Resource Manager
To address the increased dynamism we advance the concept of Resource Management (RM). Resource Management is intended for the coordination of applications and network (segments) in order to give guarantees for the data exchange that is essential for safe and timely communication. On the one hand, RM will be used to coordinate all in-vehicle traffic, including streaming and caching. Furthermore, we extend RM to enable inter-vehicle communication between the trucks in a platoon by coordinating the wireless channel.

Reliable Wireless Communications
While the coordination of data streams/traffic on the wireless channel is essential for safe operation, it does not suffice for all encompassing safety considerations. With the wireless channel being a shared medium where collisions can occur and where a diverse set of physical effects can affect transmissions, the communication over such channels is inherently lossy. As a result, without proper error protection mechanisms data exchange under timing and safety constraints as imposed by the platooning application is not possible. To address this issue and ensure safe wireless data exchange we apply DDS/RTPS-based error-protection protocols to the platooning use case. Those protocols will also interact with the RM.

 


Project partners


People at IDA

The following research staff are involved with the project at TU Braunschweig:

  • Alex Bendrick
  • Dominik Stöhrmann
  • Nora Sperling

Project-related publications

The listed material is protected by copyright. The corresponding copyright must be observed.


Rolf Ernst, Dominik Stöhrmann, Alex Bendrick and Adam Kostrzewa, "Application-centric network management–Addressing safety and real-time in V2X applications (Perspective Paper)", ACM Transactions on Embedded Computing Systems, Januar 2022.

Show bibtex code: Application-centric network management–Addressing safety and real-time in V2X applications (Perspective Paper) Display/download pdf-file: Ernst2022Application-centricnetworkmanagement–Addressing.pdf Call website/url: https://doi.org/10.1145/3528411

@article{Ernst2022Application-centricnetworkmanagement–Addressing,

author = {Rolf Ernst and Dominik Stöhrmann and Alex Bendrick and Adam Kostrzewa},
journal = {ACM Transactions on Embedded Computing Systems},
month = jan,
title = {{Application-centric network management–Addressing safety and real-time in V2X applications (Perspective Paper)}},
url = {https://doi.org/10.1145/3528411},
year = {2022}

}

Adam Kostrzewa and Rolf Ernst, "Achieving safety and performance with reconfiguration protocol for ethernet TSN in automotive systems", Elsevier Journal of Systems Architecture, vol. 118, September 2021.

Show bibtex code: Achieving safety and performance with reconfiguration protocol for ethernet TSN in automotive systems Call website/url: https://www.sciencedirect.com/science/article/abs/pii/S138376212100148X

@article{Kostrzewa2021Achievingsafetyand,

author = {Adam Kostrzewa and Rolf Ernst},
journal = {Elsevier Journal of Systems Architecture },
month = sep,
title = {{Achieving safety and performance with reconfiguration protocol for ethernet TSN in automotive systems}},
url = {https://www.sciencedirect.com/science/article/abs/pii/S138376212100148X},
volume = {118},
year = {2021}

}

Thawra Kadeed, Borislav Nikolic and Rolf Ernst, "Safe Online Reconfiguration of Mixed-Criticality Real-Time Systems" in Pacific Rim International Symposium on Dependable Computing (PRDC), (Perth, Australia), Dezember 2020.

Show bibtex code: Safe Online Reconfiguration of Mixed-Criticality Real-Time Systems

@inproceedings{Kadeed2020SafeOnlineReconfiguration,

address = {Perth, Australia},
author = {Thawra Kadeed and Borislav Nikolic and Rolf Ernst},
booktitle = {Pacific Rim International Symposium on Dependable Computing (PRDC)},
month = dec,
title = {{Safe Online Reconfiguration of Mixed-Criticality Real-Time Systems}},
year = {2020}

}

Eberle Rambo, Bryan Donyanavard, Minjun Seo, Florian Maurer, Thawra Kadeed, Caio De Melo, Biswadip Maity, Anmol Surhonne, Andreas Herkersdorf, Fadi Kurdahi, Nikil Dutt and Rolf Ernst, "The Self-Aware Information Processing Factory Paradigm for Mixed-Critical Multiprocessing", IEEE Transactions on Emerging Topics in Computing, August 2020.

Show bibtex code: The Self-Aware Information Processing Factory Paradigm for Mixed-Critical Multiprocessing

@article{RamboIEEETransactionsonEmergingTopicsinComputingTheSelf-AwareInformation,

author = {Eberle Rambo and Bryan Donyanavard and Minjun Seo and Florian Maurer and Thawra Kadeed and Caio De Melo and Biswadip Maity and Anmol Surhonne and Andreas Herkersdorf and Fadi Kurdahi and Nikil Dutt and Rolf Ernst},
journal = {IEEE Transactions on Emerging Topics in Computing},
month = aug,
title = {{The Self-Aware Information Processing Factory Paradigm for Mixed-Critical Multiprocessing}},
year = {2020}

}

Thawra Kadeed, "Dynamic Energy Management of Mixed-Criticality Real-Time Networks-on-Chip" in Design, Automation & Test in Europe Conference (DATE), (Grenoble, France), März 2020.

Show bibtex code: Dynamic Energy Management of Mixed-Criticality Real-Time Networks-on-Chip

@inproceedings{Kadeed2020DynamicEnergyManagement,

address = {Grenoble, France},
author = {Thawra Kadeed},
booktitle = {Design, Automation & Test in Europe Conference (DATE)},
month = 0,
title = {{Dynamic Energy Management of Mixed-Criticality Real-Time Networks-on-Chip}},
year = {2020}

}

Eberle A. Rambo, Robin Hapka and Rolf Ernst, "A Gem5 Multi-OS Mixed-critical Many-core Simulation Model for Self-aware Systems" in 40th IEEE Real-Time Systems Symposium (RTSS) - Demo in RTSS@Work, Dezember 2019.

Show bibtex code: A Gem5 Multi-OS Mixed-critical Many-core Simulation Model for Self-aware Systems

@inproceedings{Rambo2019AGem5Multi-OS,

author = {Eberle A. Rambo and Robin Hapka and Rolf Ernst},
booktitle = {40th IEEE Real-Time Systems Symposium (RTSS) - Demo in RTSS@Work},
month = dec,
title = {{A Gem5 Multi-OS Mixed-critical Many-core Simulation Model for Self-aware Systems}},
year = {2019}

}

Thawra Kadeed, "A Comprehensive Framework for Energy Management of Hard Real-time Networks-on-Chip" in Proceedings of the IEEE Real-Time Systems Symposium (RTSS) @ WORK, (Hong Kong, China), Dezember 2019.

Show bibtex code: A Comprehensive Framework for Energy Management of Hard Real-time Networks-on-Chip

@inproceedings{Kadeed2019AComprehensiveFramework,

address = {Hong Kong, China},
author = {Thawra Kadeed},
booktitle = {Proceedings of the IEEE Real-Time Systems Symposium (RTSS) @ WORK},
month = dec,
title = {{A Comprehensive Framework for Energy Management of Hard Real-time Networks-on-Chip}},
year = {2019}

}

Thawra Kadeed, Sebastian Tobuschat and Rolf Ernst, "Integrated Energy Control for Hard Real-time Networks-on-Chip" in Proceedings of the IEEE Real-Time Systems Symposium (RTSS), (Hong Kong, China), Dezember 2019.

Show bibtex code: Integrated Energy Control for Hard Real-time Networks-on-Chip

@inproceedings{Kadeed2019IntegratedEnergyControl,

address = {Hong Kong, China},
author = {Thawra Kadeed and Sebastian Tobuschat and Rolf Ernst},
booktitle = {Proceedings of the IEEE Real-Time Systems Symposium (RTSS)},
month = dec,
title = {{Integrated Energy Control for Hard Real-time Networks-on-Chip}},
year = {2019}

}

Eberle A. Rambo, Thawra Kadeed, Rolf Ernst, Minjun Seo, Fadi Kurdahi, Bryan Donyanavard, Caio Batista de Melo, Biswadip Maity, Kasra Moazzemi, Kenneth Stewart, Saehanseul Yi, Amir M. Rahmani, Nikil Dutt, Florian Maurer, Nguyen Anh Vu Doan, Anmol Surhonne, Thomas Wild and Andreas Herkersdorf, "The Information Processing Factory: A Paradigm for Life Cycle Management of Dependable Systems" in IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS'19), (New York, New York, USA), Oktober 2019.

Show bibtex code: The Information Processing Factory: A Paradigm for Life Cycle Management of Dependable Systems

@inproceedings{Rambo2019TheInformationProcessing,

address = {New York, New York, USA},
author = {Eberle A. Rambo and Thawra Kadeed and Rolf Ernst and Minjun Seo and Fadi Kurdahi and Bryan Donyanavard and Caio Batista de Melo and Biswadip Maity and Kasra Moazzemi and Kenneth Stewart and Saehanseul Yi and Amir M. Rahmani and Nikil Dutt and Florian Maurer and Nguyen Anh Vu Doan and Anmol Surhonne and Thomas Wild and Andreas Herkersdorf},
booktitle = {IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS'19)},
month = oct,
title = {{The Information Processing Factory: A Paradigm for Life Cycle Management of Dependable Systems}},
year = {2019}

}

Robin Hofmann, Borislav Nikolic and Rolf Ernst, "Challenges and Limitations of IEEE 802.1 CB-2017", , No. 12(4), pp. 105-108, April 2019.

Show bibtex code: Challenges and Limitations of IEEE 802.1 CB-2017

@article{Hofmann2019ChallengesandLimitations,

author = {Robin Hofmann and Borislav Nikolic and Rolf Ernst},
booktitle = {IEEE Embedded Systems Letters},
month = apr,
pages = {105-108},
title = {{Challenges and Limitations of IEEE 802.1 CB-2017}},
number = {12(4)},
year = {2019}

}

Thawra Kadeed, Sebastian Tobuschat, Adam Kostrzewa and Rolf Ernst, "Safe and efficient power management of hard real-time networks-on-chip", Integration, the VLSI Journal, Oktober 2018.

Show bibtex code:  Safe and efficient power management of hard real-time networks-on-chip Display/download pdf-file: Kadeed2018Safeand.pdf

@article{Kadeed2018Safeand,

author = {Thawra Kadeed and Sebastian Tobuschat and Adam Kostrzewa and Rolf Ernst},
journal = {Integration, the VLSI Journal},
month = oct,
note = {https://doi.org/10.1016/j.vlsi.2018.10.007},
title = {{ Safe and efficient power management of hard real-time networks-on-chip }},
year = {2018}

}

Mischa Möstl, Johannes Schlatow, Rolf Ernst, Nikil Dutt, Ahmed Nassar, Amir Rahmani, Fadi J. Kurdahi, Thomas Wild, Armin Sadighi and Andreas Herkersdorf, "Platform-Centric Self-Awareness as a Key Enabler for Controlling Changes in CPS", Proceedings of the IEEE, vol. 106, No. 9, pp. 1543-1567, September 2018.

Show bibtex code: Platform-Centric Self-Awareness as a Key Enabler for Controlling Changes in CPS Display/download pdf-file: moestl2018platformcentric.pdf Call website/url: https://doi.org/10.1109/JPROC.2018.2858023

@article{moestl2018platformcentric,

author = {Mischa Möstl and Johannes Schlatow and Rolf Ernst and Nikil Dutt and Ahmed Nassar and Amir Rahmani and Fadi J. Kurdahi and Thomas Wild and Armin Sadighi and Andreas Herkersdorf},
journal = {Proceedings of the IEEE},
month = sep,
pages = {1543-1567},
title = {{Platform-Centric Self-Awareness as a Key Enabler for Controlling Changes in CPS}},
url = {https://doi.org/10.1109/JPROC.2018.2858023},
volume = {106},
year = {2018}

}

Armin Sadighi, Bryan Donyanavard, Thawra Kadeed, Kasra Moazzemi, Tiago Mück, Ahmed Nassar, Amir M. Rahmani, Thomas Wild, Nikil Dutt, Rolf Ernst, Andreas Herkersdorf and Fadi J. Kurdahi, "Design Methodologies for Enabling Self-awareness in Autonomous Systems" in Proceedings of Design, Automation and Test in Europe Conference (DATE), März 2018, Special Session Paper.

Show bibtex code: Design Methodologies for Enabling Self-awareness in Autonomous Systems

@inproceedings{Sadighi2018DesignMethodologiesfor,

author = {Armin Sadighi and Bryan Donyanavard and Thawra Kadeed and Kasra Moazzemi and Tiago Mück and Ahmed Nassar and Amir M. Rahmani and Thomas Wild and Nikil Dutt and Rolf Ernst and Andreas Herkersdorf and Fadi J. Kurdahi},
booktitle = {Proceedings of Design, Automation and Test in Europe Conference (DATE)},
month = 0,
note = {Special Session Paper},
title = {{Design Methodologies for Enabling Self-awareness in Autonomous Systems}},
year = {2018}

}

Rolf Ernst, "Automated Driving: The Cyber-Physical Perspective", IEEE Computer, vol. 51, No. 9, pp. 76-79, September 2018.

Show bibtex code: Automated Driving: The Cyber-Physical Perspective Call website/url: https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8481268

@article{ernst2018automated,

author = {Rolf Ernst},
journal = {IEEE Computer},
month = sep,
pages = {76-79},
title = {{Automated Driving: The Cyber-Physical Perspective}},
url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8481268},
volume = {51},
year = {2018}

}

Nikil Dutt, Fadi J. Kurdahi, Rolf Ernst and Andreas Herkersdorf, "Conquering MPSoC complexity with principles of a self-aware information processing factory" in Proceedings of the 11th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS), (Pittsburgh, Pennsylvania), Oktober 2016, Special Session Paper.

Show bibtex code: Conquering MPSoC complexity with principles of a self-aware information processing factory Call website/url: http://ieeexplore.ieee.org/document/7750995/

@inproceedings{dutt2016conquering,

address = {Pittsburgh, Pennsylvania},
author = {Nikil Dutt and Fadi J. Kurdahi and Rolf Ernst and Andreas Herkersdorf},
booktitle = {Proceedings of the 11th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)},
month = oct,
note = {Special Session Paper},
series = {CODES'16},
title = {{Conquering MPSoC complexity with principles of a self-aware information processing factory}},
url = {http://ieeexplore.ieee.org/document/7750995/},
year = {2016}

}

Contact

Institute of Computer and Network Engineering
Hans-Sommer-Street 66
38106 Braunschweig
Phone: +49 (0)531 391-3734
Fax: +49 (0)531 391-4587

Office hours:

Mon: 
10:15-11:30 and 14:00-15:00
Tue:   
10:15-11:30 and 14:00-15:00
Wed:   
14:00-15:00
Thu: 
10:15-11:30 and 14:00-15:00
Fri:
10:15-11:30 and 14:00-15:00
© Technische Universität Braunschweig
Legal notice / disclaimer Data Privacy Policy