remote repo main 브랜치 업데이트 적용 #8

hancheolcho · 2025-02-28T07:27:31Z

내용

https://github.com/EvolvingLMMs-Lab/lmms-eval 의 메인 브랜치에 업데이트 된 사항들을 적용합니다.

DocVision과는 관련 없는 부분이라서 그냥 한번 쭉 봐주신 후에 approve 해주셔도 될 것 같습니다.

* add charades_sta for temporal grounding * update * update formatting --------- Co-authored-by: john.zhang <[email protected]>

* Update README.md * Update README.md * [Fix] of "mvbench missing videos" Modify DATA_LIST so that it can find the corresponding video * [Fix] of "evaluation of llava_vid on mvbench" "mvbench_video/tvqa/frames_fps3_hq/castle_s07e04_seg02_clip_14" is the sampled video frame, not the original video file. And The current code logic cannot handle this subtask. * Fixing lmms_eval/models/llava_vid.py

* Add VLLM model integration and update configurations - Introduce VLLM model in the model registry. - Update AVAILABLE_MODELS to include new models: - models/__init__.py: Added "aria", "internvideo2", "llama_vision", "oryx", "ross", "slime", "videochat2", "vllm", "xcomposer2_4KHD", "xcomposer2d5". - Create vllm.py for VLLM model implementation: - Implemented encoding for images and videos. - Added methods for generating responses and handling multi-round generation. - Update mmu tasks with new prompt formats and evaluation metrics: - mmmu_val.yaml: Added specific kwargs for prompt types. - mmmu_val_reasoning.yaml: Enhanced prompts for reasoning tasks. - utils.py: Adjusted evaluation rules and scoring for predictions. - Added script for easy model execution: - vllm_qwen2vl.sh: Script to run VLLM with specified parameters. * Set environment variables for VLLM script - Configure environment for better performance and debugging. - Added variables to control multiprocessing and NCCL behavior. miscs/vllm_qwen2vl.sh: - Set `VLLM_WORKER_MULTIPROC_METHOD` to `spawn` for compatibility. - Enabled `NCCL_BLOCKING_WAIT` to avoid hangs. - Increased `NCCL_TIMEOUT` to 18000000 for long-running processes. - Set `NCCL_DEBUG` to `DEBUG` for detailed logs. * Rename scripts and update paths - Renamed representation scripts for clarity. - miscs/repr_scripts.sh -> miscs/model_dryruns/llava_1_5.sh - miscs/cicd_qwen2vl.sh -> miscs/model_dryruns/qwen2vl.sh - miscs/tinyllava_repr_scripts.sh -> miscs/model_dryruns/tinyllava.sh - miscs/vllm_qwen2vl.sh -> miscs/model_dryruns/vllm_qwen2vl.sh - Updated parameters in the vllm_qwen2vl.sh script. - miscs/model_dryruns/vllm_qwen2vl.sh: Added `--limit=64` to output path command. * Optimize image handling in VLLM model - Simplify image conversion in the `to_base64` method: - vllm.py: Directly convert input image to RGB format instead of copying it. - Remove unnecessary base64 encoding for images: - vllm.py: Return the PIL image directly instead of converting it to base64. - Update video frame processing to return PIL images: - vllm.py: Replace base64 encoding of frames with returning the PIL frames directly. * Revert "Optimize image handling in VLLM model" This reverts commit 469e1fc. * use threads to encode visuals --------- Co-authored-by: kcz358 <[email protected]>

* Update API usage and add environment configuration - Adjust API client initialization for OpenAI and Azure. - Modify image encoding to handle size limits and maintain aspect ratio. - Add new `OpenAICompatible` model class. - Introduce retry mechanism for API requests with configurable retries. - Update `.gitignore` to exclude `.env` and `scripts/`. .gitignore: - Exclude `.env` file for security. - Ensure no scripts directory is tracked. lmms_eval/models/gpt4v.py: - Refactor API client initialization to use new OpenAI and Azure clients. - Update image encoding to handle size limits with resizing logic. - Adjust retry logic for API calls, reducing sleep time. lmms_eval/models/openai_compatible.py: - Create new `OpenAICompatible` model class with similar structure. - Implement encoding functions for images and videos. - Integrate environment variable loading and persistent response caching. miscs/model_dryruns/openai_compatible.sh: - Add sample script for running the new model. * Improve code readability and organization - Remove unused import for deepcopy in `openai_compatible.py`. - Add a blank line for better separation of code sections. - Adjust comment formatting for `max_size_in_mb` for consistency. - Ensure consistent spacing around comments. File: `lmms_eval/models/openai_compatible.py` - Removed `deepcopy` import: cleaned up unnecessary code. - Added blank line after `load_dotenv`: improved readability. - Reformatted comment on `max_size_in_mb`: enhanced clarity. - Removed extra blank line before `Accelerator`: tightened spacing. * Fix init --------- Co-authored-by: kcz358 <[email protected]>

* [Model] add Ola audio * [Model] add Ola * [Model] Ola * [Model] add Ola * [Model] add Ola * [Model] add Ola * [Model] add Ola * Direct import for moviepy --------- Co-authored-by: kcz358 <[email protected]>

…ingLMMs-Lab#554)

* Update utils.py * Update utils.py * Update utils.py

…nk (EvolvingLMMs-Lab#560) * upload egogpt model * support egoplan benchmark for lmms-eval * support egothink benchmark for lmms-eval

tkdcjf159

변경사항 확인했습니다. Approve 합니다.

tkdcjf159 · 2025-03-04T15:24:03Z

lmms_eval/models/__init__.py

모델 추가된 것 확인했습니다.

tkdcjf159 · 2025-03-04T15:26:09Z

lmms_eval/models/gpt4v.py

+
+        # If image is too large, resize it while maintaining aspect ratio
+        while len(byte_data) > max_size and img.size[0] > 100 and img.size[1] > 100:
+            new_size = (int(img.size[0] * 0.75), int(img.size[1] * 0.75))


원본의 75%로 리사이즈 하는군요. 리사이즈 해도 엄청 큰 이미지도 있을텐데 ㅎㅎ 체크하고 갑니다.

tkdcjf159 · 2025-03-04T15:28:38Z

lmms_eval/models/vllm.py

+        self.threads = threads
+
+        accelerator = Accelerator()
+        self.client = LLM(


vllm 에서 동작하게 만들면 lmms-eval에서 쓸수 있겠군요.

네, 지금 eval 돌리면 40-50분 걸리는데, 좀 더 빨라지지 않을까 합니다.
필수인 듯 하네요 ㅋ

tkdcjf159 · 2025-03-04T15:30:31Z

lmms_eval/tasks/mmmu/mmmu_val_reasoning.yaml

@@ -16,11 +26,15 @@ metric_list:
 lmms_eval_specific_kwargs:
  default:
    prompt_type: "reasoning"
-    multiple_choice_prompt: "Please show step-by-step reasoning, and answer the question with option letter from given choices."
-    open_ended_prompt: "Please show step-by-step reasoning, and answer the question using a single word or phrase."
+    multiple_choice_prompt: "Please reason step by step, and answer the question with option letter from given choices in the format of Answer: <option letter>."


mmmu reasoning prompt 확인했습니다.

tkdcjf159 · 2025-03-04T15:31:51Z

miscs/model_dryruns/vllm_qwen2vl.sh

+export NCCL_TIMEOUT=18000000
+export NCCL_DEBUG=DEBUG
+
+python3 -m lmms_eval \


vllm 사용하여 평가하는 command 확인했습니다.

kcz358 and others added 11 commits February 14, 2025 08:51

Fix mmmu doc to text (EvolvingLMMs-Lab#540)

352b62d

Yhzhang/add charades sta (EvolvingLMMs-Lab#536)

c548a19

* add charades_sta for temporal grounding * update * update formatting --------- Co-authored-by: john.zhang <[email protected]>

Update README.md

eb2dadc

[Model] Support Ola (EvolvingLMMs-Lab#558)

00febf5

* [Model] add Ola audio * [Model] add Ola * [Model] Ola * [Model] add Ola * [Model] add Ola * [Model] add Ola * [Model] add Ola * Direct import for moviepy --------- Co-authored-by: kcz358 <[email protected]>

Improve string processing order for better whitespace handling (Evolv…

56cf355

…ingLMMs-Lab#554)

Fix mlvu aggregation utils logic (EvolvingLMMs-Lab#555)

13d139a

* Update utils.py * Update utils.py * Update utils.py

Contribute EgoLife model and evaluation pipeline for EgoPlan & Egothi…

d6ed800

…nk (EvolvingLMMs-Lab#560) * upload egogpt model * support egoplan benchmark for lmms-eval * support egothink benchmark for lmms-eval

Resolved conflicts

a314bda

hancheolcho added the enhancement New feature or request label Feb 28, 2025

hancheolcho self-assigned this Feb 28, 2025

hancheolcho requested review from tkdcjf159 and seunggu-upstage February 28, 2025 07:30

tkdcjf159 approved these changes Mar 4, 2025

View reviewed changes

hancheolcho merged commit b3648c3 into main Mar 4, 2025
0 of 2 checks passed

hancheolcho deleted the update/20250228_sync_fork branch March 4, 2025 23:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remote repo main 브랜치 업데이트 적용 #8

remote repo main 브랜치 업데이트 적용 #8

hancheolcho commented Feb 28, 2025 •

edited

Loading

tkdcjf159 left a comment

tkdcjf159 Mar 4, 2025

tkdcjf159 Mar 4, 2025

tkdcjf159 Mar 4, 2025

hancheolcho Mar 4, 2025

tkdcjf159 Mar 4, 2025

tkdcjf159 Mar 4, 2025

remote repo main 브랜치 업데이트 적용 #8

remote repo main 브랜치 업데이트 적용 #8

Conversation

hancheolcho commented Feb 28, 2025 • edited Loading

내용

tkdcjf159 left a comment

Choose a reason for hiding this comment

tkdcjf159 Mar 4, 2025

Choose a reason for hiding this comment

tkdcjf159 Mar 4, 2025

Choose a reason for hiding this comment

tkdcjf159 Mar 4, 2025

Choose a reason for hiding this comment

hancheolcho Mar 4, 2025

Choose a reason for hiding this comment

tkdcjf159 Mar 4, 2025

Choose a reason for hiding this comment

tkdcjf159 Mar 4, 2025

Choose a reason for hiding this comment

hancheolcho commented Feb 28, 2025 •

edited

Loading