pbdl-book/diffphys-discuss.md

Discussion
=======================

The previous sections have explained the _differentiable physics_ approach for deep learning, and have given a range of examples: from a very basic gradient calculation, all the way to complex learning setups powered by advanced simulations. This is a good time to take a step back and evaluate: in the end, the differentiable physics components of these approaches are not too complicated. They are largely based on existing numerical methods, with a focus on efficiently using those methods not only to do a forward simulation, but also to compute gradient information. 
What is primarily exciting in this context are the implications that arise from the combination of these numerical methods with deep learning.

![Divider](resources/divider6.jpg)

## Integration

Most importantly, training via differentiable physics allows us to seamlessly bring the two fields together:
we can obtain _hybrid_ methods, that use the best numerical methods that we have at our disposal for the simulation itself, as well as for the training process. We can then use the trained model to improve forward or backward solves. Thus, in the end, we have a solver that combines a _traditional_ solver and a _learned_ component that in combination can improve the capabilities of numerical methods.

## Interaction

One key aspect that is important for these hybrids to work well is to let the NN _interact_ with the PDE solver at training time. Differentiable simulations allow a trained model to "explore and experience" the physical environment, and receive directed feedback regarding its interactions throughout the solver iterations. This combination nicely fits into the broader context of machine learning as _differentiable programming_. 

## Generalization

The hybrid approach also bears particular promise for simulators: it improves generalizing capabilities of the trained models by letting the PDE-solver handle large-scale _changes to the data distribution_ such that the learned model can focus on localized structures not captured by the discretization. While physical models generalize very well, learned models often specialize in data distributions seen at training time. This was, e.g., shown for the models reducing numerical errors of the previous chapter: the trained models can deal with solution manifolds with significant amounts of varying physical behavior, while simpler training variants quickly deteriorate over the course of recurrent time steps.


![Divider](resources/divider5.jpg)

To summarize, the pros and cons of training NNs via DP:

✅ Pro: 
- Uses physical model and numerical methods for discretization.
- Efficiency and accuracy of selected methods carries over to training.
- Very tight coupling of physical models and NNs possible.
- Improved generalization via solver interactions.

❌ Con: 
- Not compatible with all simulators (need to provide gradients).
- Require more heavy machinery (in terms of framework support) than previously discussed methods.

_Outlook_: the last negative point (regarding heavy machinery) is bound to strongly improve given the current pace of software and API developments in the DL area. However, for now it's important to keep in mind that not every simulator is suitable for DP training out of the box. Hence, in this book we'll focus on examples using phiflow, which was designed for interfacing with deep learning frameworks. 

The training via differentiable physics (DP) allows us to integrate full numerical simulations into the training of deep neural networks.
It is also a very generic approach that is applicable to a wide range of combinations of PDE-based models and deep learning. 

In the next chapters, we will first compare DP training to model-free alternatives for control problems, and afterwards target the underlying learning process to obtain even better NN states.
diffphys discussion 2021-01-10 14:15:50 +01:00			`Discussion`
			`=======================`

restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			The previous sections have explained the _differentiable physics_ approach for deep learning, and have given a range of examples: from a very basic gradient calculation, all the way to complex learning setups powered by advanced simulations. This is a good time to take a step back and evaluate: in the end, the differentiable physics components of these approaches are not too complicated. They are largely based on existing numerical methods, with a focus on efficiently using those methods not only to do a forward simulation, but also to compute gradient information.
			`What is primarily exciting in this context are the implications that arise from the combination of these numerical methods with deep learning.`
updated figures, and texts for figures 2021-04-01 10:42:03 +02:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`![Divider](resources/divider6.jpg)`
updated figures, and texts for figures 2021-04-01 10:42:03 +02:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`## Integration`
updated figures, and texts for figures 2021-04-01 10:42:03 +02:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`Most importantly, training via differentiable physics allows us to seamlessly bring the two fields together:`
			`we can obtain _hybrid_ methods, that use the best numerical methods that we have at our disposal for the simulation itself, as well as for the training process. We can then use the trained model to improve forward or backward solves. Thus, in the end, we have a solver that combines a _traditional_ solver and a _learned_ component that in combination can improve the capabilities of numerical methods.`
updated figures, and texts for figures 2021-04-01 10:42:03 +02:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`## Interaction`
updated figures, and texts for figures 2021-04-01 10:42:03 +02:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`One key aspect that is important for these hybrids to work well is to let the NN _interact_ with the PDE solver at training time. Differentiable simulations allow a trained model to "explore and experience" the physical environment, and receive directed feedback regarding its interactions throughout the solver iterations. This combination nicely fits into the broader context of machine learning as _differentiable programming_.`
udpated notation, into control 2021-03-08 04:15:00 +01:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`## Generalization`
udpated notation, into control 2021-03-08 04:15:00 +01:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			The hybrid approach also bears particular promise for simulators: it improves generalizing capabilities of the trained models by letting the PDE-solver handle large-scale _changes to the data distribution_ such that the learned model can focus on localized structures not captured by the discretization. While physical models generalize very well, learned models often specialize in data distributions seen at training time. This was, e.g., shown for the models reducing numerical errors of the previous chapter: the trained models can deal with solution manifolds with significant amounts of varying physical behavior, while simpler training variants quickly deteriorate over the course of recurrent time steps.
udpated notation, into control 2021-03-08 04:15:00 +01:00

updated teaser, added dividers 2021-04-11 14:17:03 +02:00			`![Divider](resources/divider5.jpg)`

restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`To summarize, the pros and cons of training NNs via DP:`
udpated notation, into control 2021-03-08 04:15:00 +01:00
diffphys ns phiflow2 update, WIP 2021-02-27 07:07:42 +01:00			`✅ Pro:`
PG conclusions, list formatting 2021-03-26 03:28:05 +01:00			`- Uses physical model and numerical methods for discretization.`
DP dicussion update 2021-06-01 11:53:21 +02:00			`- Efficiency and accuracy of selected methods carries over to training.`
			`- Very tight coupling of physical models and NNs possible.`
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`- Improved generalization via solver interactions.`
sol code 2021-01-22 13:31:22 +01:00
diffphys ns phiflow2 update, WIP 2021-02-27 07:07:42 +01:00			`❌ Con:`
PG conclusions, list formatting 2021-03-26 03:28:05 +01:00			`- Not compatible with all simulators (need to provide gradients).`
			`- Require more heavy machinery (in terms of framework support) than previously discussed methods.`
dp ns code update 2021-03-02 08:03:15 +01:00
restructured and updated diffphys chapter 2022-03-06 07:44:18 +01:00			`_Outlook_: the last negative point (regarding heavy machinery) is bound to strongly improve given the current pace of software and API developments in the DL area. However, for now it's important to keep in mind that not every simulator is suitable for DP training out of the box. Hence, in this book we'll focus on examples using phiflow, which was designed for interfacing with deep learning frameworks.`
added DP control, small update of PINN code 2021-03-04 06:32:21 +01:00
minor cleanup 2022-03-07 01:43:59 +01:00			`The training via differentiable physics (DP) allows us to integrate full numerical simulations into the training of deep neural networks.`
			`It is also a very generic approach that is applicable to a wide range of combinations of PDE-based models and deep learning.`

			`In the next chapters, we will first compare DP training to model-free alternatives for control problems, and afterwards target the underlying learning process to obtain even better NN states.`