# Introduction and Motivation

The document describes a project completed as part of [Altdeep's Causal Modeling in Machine Learning Workshop](https://altdeep.ai/p/causal-ml), taught by Robert Osazuwa Ness.

The full code of this work can be found at

{% embed url="<https://github.com/robertness/causalML/tree/master/projects/causal_scene_generation>" %}
Causal Scene Generation Codebase
{% endembed %}

## Motivation

The following figure describes Scott McCloud's "Picture Plane", which first appeared in his 1994 book *Understanding Comics: The Invisible Art*.

![Images decomposed to Picture Plane and Language Plane](https://1424433991-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-MFBM73ocV8YCzRKuMr4%2F-MFXhYh20fAllReZFLJi%2F-MFYcjzjhPDXS82kfBzp%2Foverview.png?alt=media\&token=15bafe85-a9c5-4d9a-9c54-e529e874fd45)

The image describes two axes for simplifying a real-world image.  On one axis, we simplify the image into abstract geometric shapes.  The bottom axis simplifies a real image into symbols that are meaningful to humans.  As we move down that axis, a face becomes more symbolic -- it only preserves elements that are meaningful to humans, in terms of having clear markers of gender and emotional expression.  Ultimately, it crosses the line into the written word, which is purely symbolic and non-pictorial.  In supervised machine learning, we'd call this a label.

This figure is interesting because it gives a clear division of labor between the modeler and deep learning.  Deep learning is good at composing geometric primitives into realistic images, humans are good at conceptualizing and representing what is meaningful about a realistic image.

In this tutorial, we use a variational autoencoder infrastructure to build a causal computer vision model.  We model the causal-effect relationships of the system explicitly and rely on the decoder to handle the geometric abstractions.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://linkinnation1792.gitbook.io/causal-scene-generation/master.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
