Voxel Representation Model

Installation

conda create -y --prefix $(pwd)/.conda python=3.10
conda activate $(pwd)/.conda
pip install torch torchvision torchaudio
pip install git+https://github.com/huggingface/[email protected]
pip install accelerate python-dotenv matplotlib trimesh scipy omegaconf vllm
conda install -y -c conda-forge libstdcxx-ng

For Training

git clone https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory
git checkout 42e090d38b986bbf989055851209f91565b69e89
pip install -e ".[torch,metrics]"
cd ..
pip install deepspeed liger-kernel

Download Test set

git clone https://huggingface.co/datasets/homebrewltd/voxel-representation downloaded_dataset
cp -r downloaded_dataset/test output/test 
cp -r downloaded_dataset/test_100 output/test_100

Getting Started

Run Inference for 100 examples

export HF_HOME=$(pwd)/.hf_home
CUDA_VISIBLE_DEVICES=0 python inference_vllm.py --checkpoint-path homebrewltd/voxel-representation-gemma3-4b --input-dir output/test_100/image2d --output-filename output/test_100/model_prediction.jsonl

Visualize

Labels

python data_visualization.py --data-path output/test_100 --example-id <id>

id can be 0-99

Model Prediction

python data_visualization.py --data-path output/test_100 --labels-file output/test_100/model_prediction.jsonl --example-id <id>

id can be 0-99

Getting Eval Result

Run Inference for test set

export HF_HOME=$(pwd)/.hf_home
CUDA_VISIBLE_DEVICES=0 python inference_vllm.py --checkpoint-path homebrewltd/voxel-representation-gemma3-4b --input-dir output/test/image2d --output-filename output/test/model_prediction.jsonl

Run Eval

python data_eval.py --label output/test/labels.jsonl  --output output/test/model_prediction.jsonl

Training

Data Preparation

Download ModelNet 40 Dataset

wget http://modelnet.cs.princeton.edu/ModelNet40.zip
unzip ModelNet40.zip

Generate Synthesis data

# edit your path to model net in  config/data_train.yaml 
cp config/data_train.example.yaml config/data_train.yaml 
python data_generation.py

Convert Synthesis data to LLaMA-Factory format

python process_data.py output/train

Training

export HF_HOME=$(pwd)/.hf_home
DISABLE_VERSION_CHECK=1 llamafactory-cli  train ./config/training_gemma3_pt_visual.yaml

Push to hub

python push_to_hub.py <username> <checkpoint_number> saves/gemma3-4b-pt/full/sft/checkpoint-1100

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voxel Representation Model

Installation

For Training

Download Test set

Getting Started

Run Inference for 100 examples

Visualize

Labels

Model Prediction

Getting Eval Result

Run Inference for test set

Run Eval

Training

Data Preparation

Download ModelNet 40 Dataset

Generate Synthesis data

Convert Synthesis data to LLaMA-Factory format

Training

Push to hub

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
data		data
output		output
src		src
.gitignore		.gitignore
README.md		README.md
data_eval.py		data_eval.py
data_generation.py		data_generation.py
data_visualization.py		data_visualization.py
inference_vllm.py		inference_vllm.py
process_data.py		process_data.py
push_to_hub.py		push_to_hub.py
result.gif		result.gif

menloresearch/voxel-representation

Folders and files

Latest commit

History

Repository files navigation

Voxel Representation Model

Installation

For Training

Download Test set

Getting Started

Run Inference for 100 examples

Visualize

Labels

Model Prediction

Getting Eval Result

Run Inference for test set

Run Eval

Training

Data Preparation

Download ModelNet 40 Dataset

Generate Synthesis data

Convert Synthesis data to LLaMA-Factory format

Training

Push to hub

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages