AI Toolbox
A curated collection of 960 free cutting edge AI papers with code and tools for text, image, video, 3D and audio generation and manipulation.
3DHM can animate people with 3D camera control from a single image and a given target video motion sequence.
MaPa can generate high-quality materials for 3D meshes! It can create segment-wise procedural material graphs as the appearance representation, which supports high-quality rendering and provides significant flexibility in editing.
SemLayoutDiff can generate diverse 3D indoor scenes by creating detailed semantic maps and placing furniture while considering doors and windows.
3DV-TON can generate high-quality videos for trying on clothes using 3D models. It handles complex clothing patterns and different body poses well, and it has a strong masking method to reduce errors.
CanonSwap can transfer identities from images to videos while keeping natural movements like head poses and facial expressions.
Hunyuan-GameCraft can generate interactive game videos by combining keyboard and mouse inputs into a shared camera view.
Vivid-VR can restore and enhance videos using a text-to-video diffusion transformer. It achieves realistic textures and smooth motion while preserving content and giving users control over the video generation process.
Lumen can replace video backgrounds while adjusting the lighting of the foreground for a consistent look.
OmniTry can let users try on jewelry and accessories without needing a mask.
LongSplat can create high-quality 3D scenes from long videos without needing camera positions.
MyTimeMachine can change faces to look older or younger using a global aging model. It needs just 50 selfies to keep the person’s identity, making it great for visual effects and realistic age transformations.
HOIDiNi can generate realistic human-object interactions with accurate hand contact and natural body movement from text prompts.
SewingLDM can generate complex sewing patterns using text prompts, body shapes, and garment sketches. It allows for detailed customization and significantly improves the design of garments to fit different body types.
AnimateAnyMesh can animate 3D meshes based on text prompts.
PERSONA can create personalized 3D avatars from a single image, allowing for realistic animations that reflect the subject’s identity.
Matrix-3D can generate 3D worlds from a single image or text prompt. It allows users to explore these environments in any direction and supports both quick and detailed scene creation.
FantasyPortrait can generate high-quality animations from static images for both single and multi-character scenes.
MonetGPT can critique photos and suggest retouching edits. It explains each adjustment clearly, helps keep the subject’s identity, and allows for personalized editing plans.
WIR3D can abstract 3D shapes to enable easy shape changes.
Sketch2Anim can turn 2D storyboard sketches into high-quality 3D animations. It uses a motion generator for precise control and a neural mapper to align 2D sketches with 3D motion, allowing for easy editing and animation control.