That's the ultimate purpose of disinformation -- it's not to make you believe false things, it's to make you believe nothing.
So yes, AI fakery will contribute to that phenomenon on behalf of numerous bad actors, but it was always going to happen anyway. You don't need Hinton and Sutskever on your side if you have Aisles and Murdoch.
That's like saying: "Yes, crime might increase, but we will always have crime anyway." What will happen anyway is irrelevant precisely because it happens anyway. What's relevant is the expected increase in media distrust once everything might be a fake.
From https://arxiv.org/html/2409.11340v1
> Unlike popular diffusion models, OmniGen features a very concise structure, comprising only two main components: a VAE and a transformer model, without any additional encoders.
> OmniGen supports arbitrarily interleaved text and image inputs as conditions to guide image generation, rather than text-only or image-only conditions.
> Additionally, we incorporate several classic computer vision tasks such as human pose estimation, edge detection, and image deblurring, thereby extending the model’s capability boundaries and enhancing its proficiency in complex image generation tasks.
This enables prompts for edits like: "|image_1| Put a smile face on the note." or "The canny edge of the generated picture should look like: |image_1|"
> To train a robust unified model, we construct the first large-scale unified image generation dataset X2I, which unifies various tasks into one format.
Not exactly. They mention starting from the VAE from Stable Diffusion XL and the Transformer from Phi3.
Looks like these LLMs can really be used for anything
Took a few minutes to load, some assets download at less than 1kbps. The first 3 times I got a "Connection error" after 30s. The 4th time has now been running for 5m.
It's pretty impressive so far. Image quality isn't mind-blowing, but the multi-modal aspects are almost disturbingly powerful.
Not a lot of guardrails, either.
Or, if you need solid regions that overlap and mask out other regions, then generate objects over a chroma-keyable flat background.
Transparent Image Layer Diffusion using Latent Transparency
While this will enable a certain degree of more spam it will more importantly, on the positive side of things, democratize the creative process to those who want to tell a story in images but lack the skill and resources to churn it out traditionally.
Check out:
It is expensive though- Flux dev images are like $0.035/image