A friend asked me last month whether she could trust the photo calorie counter she had just installed. The answer, like most honest answers, is “mostly, with caveats.” Below is what I told her, expanded slightly so the caveats are clear.

What current models actually do

Photo calorie scanners run two steps. First, recognition: the model identifies what is on the plate. Second, estimation: it guesses portion size from the visual cues — bowl diameter, food density, packaging size. Both steps are calibrated against large databases of real photos with known nutrition values.

The recognition step is genuinely strong now. For common dishes — pasta, salads, most breakfast plates, packaged products — accuracy at the dish-identification level is well into the high nineties for clear photos. Estimation is the weaker link. Portion guessing from a single 2D image is a hard problem, and that is where most of the real-world error comes from.

Where it works well

Single-dish photos taken from above with reasonable lighting. Packaged foods with visible labels. Plain proteins like grilled chicken or salmon. Cereal in a bowl. Anything where the food has an obvious shape and a typical serving.

Where it falls apart

Mixed dishes piled on one plate, casseroles, soups with hidden ingredients, and any cuisine that is regionally specific without enough training data behind it. Tiny portions are also tricky — a teaspoon of olive oil drizzled on a salad might add 120 calories that the camera cannot really see.

What “good enough” looks like

For weight management, you do not need 5% accuracy on every meal. You need consistency. If a photo logger underestimates a particular dish by 10% but does so every time, your trend line is still useful — because trends, not single days, drive results.

The practical rule: trust the daily total within roughly ten to fifteen percent. If your goal is to lose half a kilo a week, that is the resolution that matters.

How to be the human in the loop

Three habits make photo logging dramatically more useful. One: when the model offers two candidate matches, pick the right one. Two: confirm portion if it looks off — most apps let you bump it up or down with a slider. Three: log the things that do not photograph well — oils, sauces, dressings — manually. They are the calories that disappear into the model’s blind spot.

Where Enerium lands

Enerium’s Vision AI sits at the higher end of recognition accuracy in the consumer category, with portion estimation that we deliberately keep transparent — you see the confidence number, you can correct, and your corrections feed back into your personal results. Photo logging is faster than typing, and adherence is what actually moves the needle.