vision; namely, finding a causal representation of the visual inputs while using contextual information

7 cups of water to liters

7 cups of water to litres

7 cups of water equals how many liters

7 cups equals how many liters

7 cups of water is how many liters