MagicScroll

Generation in Different Aspect Ratios

By providing control over style, concept, and layout at all foreground, midground, and background levels, our framework can meet the needs of visual storytelling content generation in various scenarios.

Chinese painting (left): "In a deep mountain enclave, where pines and cypresses thrive, Amidst swirling clouds and mist, distant rocks and peaks arrive. A dwelling stands in solitude, with a simple architectural grace, Beyond bamboo fences, children play in a joyous embrace. Beneath the trees, two figures converse with laughter and delight, Together forming a scene of tranquil seclusion, warm and bright."

Chinese painting (right): "In deep mountains, a waterfall descends from great height, Winding its way down, into a crystal pool, pure and bright. Water droplets dance, a splash of liquid light, In the distance, evergreen pines, in the foreground, rocks stand upright. The springwater murmurs, weaving a melody with stones, Surrounded by bushes, a charming scene it owns."

Cinematic panorama (top): "Amidst the sprawling castles, a troop of cavalry embarks on a quest. They traverse distant mountains, weaving through dense forests, passing rivers, castles, and perilous peaks. On the other end awaits a colossal dragon and its minions, their forms menacing, teeth bared, claws ready—intent on slaying all who dare to invade."

Cinematic panorama (bottom): "Captain David and First Mate Jess set sail for a distant voyage. Their ship gazes upon far-off peaks, navigating through thick veils of mist, braving treacherous reefs. David and Jess stand boldly at the bow, unfolding a majestic scene like a grand painting coming to life."

Comic strip (upper middle): "Early in the morning, the hero Catherine received a call to save the world. Armed and ready, she joined forces with Jack and headed towards the heart of the city. Gazing at the distant smoke, Jack's expression turned solemn as he thought of the lives lost. Despite the gravity of the situation, they bravely rescued the city, fulfilling their mission. The duo stood victorious, having overcome the challenges in their path."

Comic strip (lower middle): "In a fairy tale, there exists a village of toys, adorned with beautiful castles, lakes, and flowers. People picnic in the fields, row boats on the lake, and celebrate harvest days in the snowy season. In this village, docks, fountains, and flower houses are constructed to embellish the tranquil life of its inhabitants."

From left to right: “In a serene garden, lakes and waterfalls flow gently as two girls, dressed in long skirts, run through it. The flowing water travels through dense forests, reaching vast meadows covered with green trees. Distant mountain peaks come into view, and amidst the ever-changing clouds, the mountains and waters harmonize. In this fantastical world, we gradually witness a series of towering castles standing in the distant lakeside, narrating an ancient story under the blue sky.”

MagicScroll: Enhancing Immersive Storytelling with
Controllable Scroll Image Generation

Abstract

Method

A framework to generate nontypical aspect-ratio images from storytelling text with style and layout controls.

Results

Qualitative Comparison of Our Method with Other Baselines

Generation in Different Aspect Ratios

More Results Generated by MagicScroll

Videos Synthsized from MagicScroll Outputs

MagicScroll: Enhancing Immersive Storytelling with Controllable Scroll Image Generation

Abstract

Method

A framework to generate nontypical aspect-ratio images from storytelling text with style and layout controls.

Results

Qualitative Comparison of Our Method with Other Baselines

Generation in Different Aspect Ratios

More Results Generated by MagicScroll

Videos Synthsized from MagicScroll Outputs

MagicScroll: Enhancing Immersive Storytelling with
Controllable Scroll Image Generation