[feat] Native-Resolution Image Synthesis by pufanyi · Pull Request #113 · EvolvingLMMs-Lab/lmms-engine

pufanyi · 2025-12-10T02:41:28Z

Currently support c2i

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-10T02:43:58Z

src/lmms_engine/train/fsdp2/nit_trainer.py

+        label = torch.tensor(batch_label)
+        hw_list = torch.tensor(hw_list, dtype=torch.int32)
+
+        # # Move tensors to model device for FSDP2 compatibility
+        # device = next(self.model.parameters()).device


Move NiT training metadata to the model device

During NitTrainer.compute_loss the labels and hw_list remain on CPU (torch.tensor(...) with no .to(device) after the move block was commented out), while the latents/noise and model live on CUDA. When FlowMatchingLoss calls the NiT model, the label embeddings and rotary cache use these CPU tensors, which will raise a device mismatch as soon as training runs on GPU. Please move label and hw_list to the model device before building model_kwargs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-10T02:44:48Z

src/lmms_engine/train/fsdp2/nit_trainer.py

+        label = torch.tensor(batch_label)
+        hw_list = torch.tensor(hw_list, dtype=torch.int32)
+


Move labels and shapes to model device before forward

The new NiT trainer builds label and hw_list tensors on CPU but then passes them directly into self.loss_fn/self.model while images and latents are on CUDA. NitModel embeds y and computes RoPE grids inside the forward pass, so CPU indices against CUDA buffers will raise a device mismatch as soon as training starts. The commented-out .to(device) calls just below suggest these tensors were intended to be moved. Please transfer label and hw_list to the model device before invoking the model to avoid runtime crashes when running on GPUs.

Useful? React with 👍 / 👎.

kcz358 · 2025-12-17T03:36:26Z

examples/qwen3_vl/example_config.yaml

Wrong config edit?

😱😱😱

kcz358 · 2025-12-17T03:37:15Z

src/lmms_engine/models/nit/nvidia_radio/__init__.py

Is this part nvidia radio files all necessary? Would be nice if we can remove some parts if the modeling nit is not actually using it

ok, I will try it

…ts for NIT.

…o prevent Hydra conflicts.

…bility.

…image.

… calculate token dimensions and return class ID.

…tDataProcessor` to accept a dictionary row.

… token counts in NitDataset.

…tiple sequences into a single batch.

…s and update package dependencies.

…ating type hints in NitDataset and NitProcessor.

…, and modify example configuration for dataset format and output directory.

…r processor type

… packing strategy updates

… directory and logging settings

…le files.

… eliminate unused modeling_utils module

…itTrainer for enhanced model configuration and loss computation

…in prepare_nit function

…ding gradient checkpointing and attention implementation adjustments

…r to nit_trainer

…dp2 trainers

…ainer with additional parameters for improved training dynamics

pufanyi force-pushed the pufanyi/nit branch from 391874b to 70e43ed Compare December 10, 2025 02:43

chatgpt-codex-connector bot reviewed Dec 10, 2025

View reviewed changes

kcz358 reviewed Dec 17, 2025

View reviewed changes

pufanyi added 25 commits December 17, 2025 12:11

feat: Add NVIDIA RADIO model architecture and its supporting componen…

19d3bb4

…ts for NIT.

lint

b04eb6c

feat: Add initial structure and placeholder classes for the NiT model.

81eb889

docs: Add hydra.output_subdir=null to multi-node training example t…

a63549e

…o prevent Hydra conflicts.

docs: Clarify Hydra multi-node training conflict explanation.

0b97936

style: Update quote style in __all__ and add a blank line for reada…

785185c

…bility.

refactor: rename modeling.py to modeling_nit.py

3392323

feat: Add NIT model implementation, configuration, and data processor.

8f933f7

refactor: update NitProcessor.process to accept and return only an …

0352822

…image.

feat: Add VAE configuration parameters and update image processing to…

eaa8cae

… calculate token dimensions and return class ID.

feat: Add NitDataset for loading and processing data and update `Ni…

8368c29

…tDataProcessor` to accept a dictionary row.

refactor: reorder import statement and remove extraneous blank lines.

401df5f

feat: Implement LPFHP algorithm for efficient data packing and expose…

98c943b

… token counts in NitDataset.

feat: Implement data packing functionality in NITDataset to group mul…

25a99bd

…tiple sequences into a single batch.

style: Reorder imports and add a blank line for improved readability.

3d154e4

feat: Integrate VAE loading and encoding into NitModel using diffuser…

985c9a6

…s and update package dependencies.

refactor: Improve code readability by reformatting long lines and upd…

20f9c18

…ating type hints in NitDataset and NitProcessor.

chore: Update dependencies by removing pynvml and adding nvidia-ml-py…

5f2e431

…, and modify example configuration for dataset format and output directory.

chore: Update triton dependencies and modify example configuration fo…

87d7aea

…r processor type

feat: Enhance NitDataset and configuration with processor workers and…

34c435d

… packing strategy updates

feat: Add timm dependency and update example configuration for output…

78b6948

… directory and logging settings

style: Reorder import statements for improved readability in nit modu…

a473f75

…le files.

refactor: Remove VAE loading and encoding functions from NitModel and…

b4997ab

… eliminate unused modeling_utils module

lint

8d6d363

feat: Update NitConfig parameters and integrate FlowMatchingLoss in N…

72403fc

…itTrainer for enhanced model configuration and loss computation

pufanyi added 6 commits December 17, 2025 12:12

feat: Enhance model weight loading by supporting .safetensors format …

4a6d7a2

…in prepare_nit function

lint

6befd32

fix: Update example configuration and model parameters for Nit, inclu…

c52b52d

…ding gradient checkpointing and attention implementation adjustments

chore: Update trainer type in example configuration from fsdp2_traine…

38b6067

…r to nit_trainer

feat: Add NitConfig and NitModel imports to lmms_engine models and fs…

f229c18

…dp2 trainers

feat: Refactor loss computation in FlowMatchingLoss and enhance NitTr…

43a591e

…ainer with additional parameters for improved training dynamics

pufanyi force-pushed the pufanyi/nit branch from 70e43ed to 43a591e Compare December 17, 2025 04:13

pufanyi added 2 commits December 17, 2025 12:14

fix qwen

01bf771

lint

2e1bb5d

Luodian closed this Feb 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Native-Resolution Image Synthesis#113

[feat] Native-Resolution Image Synthesis#113
pufanyi wants to merge 33 commits intomainfrom
pufanyi/nit

pufanyi commented Dec 10, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Dec 10, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Dec 10, 2025

Uh oh!

kcz358 Dec 17, 2025

Uh oh!

pufanyi Dec 17, 2025

Uh oh!

kcz358 Dec 17, 2025 •

edited

Loading

Uh oh!

pufanyi Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		label = torch.tensor(batch_label)
		hw_list = torch.tensor(hw_list, dtype=torch.int32)

Conversation

pufanyi commented Dec 10, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

kcz358 Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

pufanyi Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

kcz358 Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pufanyi Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kcz358 Dec 17, 2025 •

edited

Loading