Thanks for the details. So we agree that this approach is not the best, I see the github issue you just opened, glad you are brainstorming new solutions.
Btw talking about atomicity, I'm also interested in this matter, but applied to text+images contexts. This is another hard point, but it can also be really useful for public content (ex. long format).