Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

humaneval #1417

Open
codeAlwaysPass opened this issue Jul 27, 2024 · 6 comments
Open

humaneval #1417

codeAlwaysPass opened this issue Jul 27, 2024 · 6 comments
Assignees
Labels
Milestone

Comments

@codeAlwaysPass
Copy link

请问可以提供测试humaneval的相关方法吗,现在想要进行测试但是完全不知道该如何下手,generate_repo返回的是一个Projectrepo类型不能被转换成jsonl,希望作者能加上去,谢谢。

@iorisa
Copy link
Collaborator

iorisa commented Jul 29, 2024

ProjectRepo对应的是一个文件夹路径。如果要转JSON,本质上是将一个文件夹转JSON。
你希望转成什么样的JSON?
@codeAlwaysPass

@codeAlwaysPass
Copy link
Author

我希望转成一个可以让humaneval评估的jsonl文件,目前我是自己写python脚本把文件夹里面的代码写入jsonl,但可能是由于我的代码问题这样得到的jsonl获得的准确率只有0.27。

@stellaHSR
Copy link
Collaborator

由于版本迭代新增功能且迭代较快,engineer 中增加了不少 repo 级别的处理和设计。针对 HumanEval,我建议你先熟悉并使用 v0.4-release 或者之前的版本。在这个版本中,你可以获取 engineer 的执行结果(msg = Message(content=code, role=self.profile, cause_by=WriteCode))。你可以选择在 msg 生成之前直接将code保存到JSONL文件,或者在 engineer 执行结束后,通过 memory 的 get_by_actions 函数来获取代码,再将其写入 JSONL 文件。:)

@geekan
Copy link
Owner

geekan commented Oct 20, 2024

There will be some scripts related to humaneval in version 1.0

@geekan geekan closed this as completed Oct 20, 2024
@geekan geekan reopened this Oct 20, 2024
@geekan geekan added this to the 1.0 milestone Oct 20, 2024
@wkwk-ai
Copy link

wkwk-ai commented Dec 17, 2024

请问现在有可以测试humaneval的script了吗?

Copy link

This issue has no activity in the past 30 days. Please comment on the issue if you have anything to add.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants