Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

检索不到文献 #1

Open
Roger-GOAT opened this issue Feb 28, 2025 · 4 comments
Open

检索不到文献 #1

Roger-GOAT opened this issue Feb 28, 2025 · 4 comments

Comments

@Roger-GOAT
Copy link

Roger-GOAT commented Feb 28, 2025

你好~~,修改检索策略后只使用单个最简单的检索词都不能检索到文献,请问是怎么回事呢

Traceback (most recent call last):
File "C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main\analyze_papers.py", line 70, in analyze_papers
export_to_text(df, text_file)
File "C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main\analyze_papers.py", line 38, in export_to_text
df['相关度数值'] = df['相关度'].apply(lambda x: float(str(x).rstrip('%')) if pd.notnull(x) else 0)
File "C:\Users\xxxxxxxxxx\miniconda3\envs\research\lib\site-packages\pandas\core\frame.py", line 4090, in getitem
indexer = self.columns.get_loc(key)
File "C:\Users\xxxxxxxxxx\miniconda3\envs\research\lib\site-packages\pandas\core\indexes\range.py", line 417, in get_loc
raise KeyError(key)
KeyError: '相关度'
2025-02-28 09:27:20,341 - INFO - 筛选后分析完成,生成报告:None,文献清单:None
2025-02-28 09:27:20,341 - INFO - 127.0.0.1 - - [28/Feb/2025 09:27:20] "POST /api/search HTTP/1.1" 200 -

@luckylykkk
Copy link
Owner

luckylykkk commented Feb 28, 2025 via email

@Roger-GOAT
Copy link
Author

我之前没有用过,刚刚部署的。这是启动后的信息
(research) PS C:\Users\xxxxxxxx\nnscholar-search\nnscholar-search-main> python app.py
C:\Users\xxxxxxxxxxxx\nnscholar-search\nnscholar-search-main\app.py:15: DeprecationWarning:
Pyarrow will become a required dependency of pandas in the next major release of pandas (pandas 3.0),
(to allow more performant data types, such as the Arrow string type, and better interoperability with other libraries)
but was not found to be installed on your system.
If this would cause problems for you,
please provide us feedback at pandas-dev/pandas#54466

import pandas as pd
2025-02-28 15:35:30,109 - INFO - 成功加载 225 个停用词
2025-02-28 15:35:30,187 - INFO - 找到.env文件: C:\Users\xxxxxxxxxxxx\nnscholar-search\nnscholar-search-main.env
2025-02-28 15:35:30,187 - INFO - 环境文件内容预览 (前100字符): # DeepSeek API配置
DEEPSEEK_API_KEY=sk-cf1e5a4c91674b0e993050a2044ff9ef

PubMed API配置

PUBMED_API_KEY...
2025-02-28 15:35:30,187 - INFO - 环境变量读取结果:
2025-02-28 15:35:30,187 - INFO - DEEPSEEK_API_KEY: sk-c...f9ef
2025-02-28 15:35:30,187 - INFO - PUBMED_API_KEY: ea55...0c08
2025-02-28 15:35:30,187 - INFO - PUBMED_EMAIL: [email protected]
2025-02-28 15:35:30,187 - INFO - TOOL_NAME: nnscholar_pubmed
2025-02-28 15:35:30,187 - INFO - PUBMED_API_URL: https://eutils.ncbi.nlm.nih.gov/entrez/eutils/
2025-02-28 15:35:30,203 - INFO - 开始加载期刊数据文件: C:\Users\xxxxxxxxxxnnscholar-search\nnscholar-search-main\data\journal_metrics\jcr_cas_ifqb.json
2025-02-28 15:35:30,313 - INFO - 成功加载期刊数据,包含 20034 条记录
2025-02-28 15:35:30,313 - INFO - 原始数据示例: [{"journal": "CA-A CANCER JOURNAL FOR CLINICIANS", "jabb": "CA-CANCER J CLIN", "issn": "0007-9235", "eissn": "1542-4863", "IF": "503.1", "Q": "Q1", "B": "B1", "T": "0"}, {"journal": "NATURE REVIEWS DRUG DISCOVERY", "jabb": "NAT REV DRUG DISCOV", "issn": "1474-1776", "eissn": "1474-1784", "IF": "122.7", "Q": "Q1", "B": "B1", "T": "1"}, {"journal": "LANCET", "jabb": "LANCET", "issn": "0140-6736", "eissn": "1474-547X", "IF": "98.4", "Q": "Q1", "B": "B1", "T": "1"}]
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Journal of Modern Chinese History
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Frontiers of History in China
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Feministische Studien
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Frontiers of Philosophy in China
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Souls
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Scandia
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Frontiers of Literary Studies in China
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Tsinghua China Law Review
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Journal for the Study of Christian Culture
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Revue Roumaine de Linguistique-Romanian Review of Linguistics
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Journal of Cellular Automata
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Kleintierpraxis
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Philippine Journal of Crop Science
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Cadmo
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Informacao & Sociedade-Estudos
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for REVUE D ETUDES COMPARATIVES EST-OUEST
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Zeitschrift fur Arznei- & Gewurzpflanzen
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for AEROSPACE AMERICA
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for INTERNATIONAL JOURNAL OF HUMAN GENETICS
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for AATCC REVIEW
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for Correspondances en Metabolismes Hormones Diabetes et Nutrition
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for MANUFACTURING ENGINEERING
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for NATURAL HISTORY
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for SECURITIES REGULATION LAW JOURNAL
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for SURFACE COATINGS INTERNATIONAL
2025-02-28 15:35:30,360 - WARNING - 无效的影响因子值: <0.1 for WOCHENBLATT FUR PAPIERFABRIKATION
2025-02-28 15:35:30,360 - INFO - 成功加载 35523 条期刊数据
2025-02-28 15:35:30,360 - INFO - 转换后的数据示例: {"00079235": {"title": "CA-A CANCER JOURNAL FOR CLINICIANS", "if": 503.1, "jcr_quartile": "Q1", "cas_quartile": "B1"}, "15424863": {"title": "CA-A CANCER JOURNAL FOR CLINICIANS", "if": 503.1, "jcr_quartile": "Q1", "cas_quartile": "B1"}, "14741776": {"title": "NATURE REVIEWS DRUG DISCOVERY", "if": 122.7, "jcr_quartile": "Q1", "cas_quartile": "B1"}}
2025-02-28 15:35:30,360 - INFO - 开始加载影响因子趋势数据: C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main\data\journal_metrics\5year.json
2025-02-28 15:35:30,752 - INFO - 成功加载影响因子趋势数据,包含 21716 条记录
2025-02-28 15:35:30,752 - INFO - 首次运行,开始下载NLTK数据...
2025-02-28 15:35:32,250 - INFO - NLTK数据下载完成
2025-02-28 15:35:32,250 - INFO - 正在启动应用服务器...

  • Serving Flask app 'app'
  • Debug mode: on
    2025-02-28 15:35:32,268 - INFO - WARNING: This is a development server. Do not use it in a production deployment. Use a production WSGI server instead.
  • Running on all addresses (0.0.0.0)
  • Running on http://127.0.0.1:5000
  • Running on http://192.168.2.9:5000
    2025-02-28 15:35:32,268 - INFO - Press CTRL+C to quit
    2025-02-28 15:35:32,268 - INFO - * Restarting with stat
    C:\Users\dengzhen\nnscholar-search\nnscholar-search-main\app.py:15: DeprecationWarning:
    Pyarrow will become a required dependency of pandas in the next major release of pandas (pandas 3.0),
    (to allow more performant data types, such as the Arrow string type, and better interoperability with other libraries)
    but was not found to be installed on your system.
    If this would cause problems for you,
    please provide us feedback at FEEDBACK: PyArrow as a required dependency and PyArrow backed strings pandas-dev/pandas#54466

import pandas as pd
2025-02-28 15:35:35,215 - INFO - 成功加载 225 个停用词
2025-02-28 15:35:35,283 - INFO - 找到.env文件: C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main.env
2025-02-28 15:35:35,283 - INFO - 环境文件内容预览 (前100字符): # DeepSeek API配置
DEEPSEEK_API_KEY=sk-cf1e5a4c91674b0e993050a2044ff9ef

PubMed API配置

PUBMED_API_KEY...
2025-02-28 15:35:35,283 - INFO - 环境变量读取结果:
2025-02-28 15:35:35,283 - INFO - DEEPSEEK_API_KEY: sk-c...f9ef
2025-02-28 15:35:35,283 - INFO - PUBMED_API_KEY: ea55...0c08
2025-02-28 15:35:35,283 - INFO - PUBMED_EMAIL: [email protected]
2025-02-28 15:35:35,283 - INFO - TOOL_NAME: nnscholar_pubmed
2025-02-28 15:35:35,283 - INFO - PUBMED_API_URL: https://eutils.ncbi.nlm.nih.gov/entrez/eutils/
2025-02-28 15:35:35,283 - INFO - 开始加载期刊数据文件: C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main\data\journal_metrics\jcr_cas_ifqb.json
2025-02-28 15:35:35,402 - INFO - 成功加载期刊数据,包含 20034 条记录
2025-02-28 15:35:35,402 - INFO - 原始数据示例: [{"journal": "CA-A CANCER JOURNAL FOR CLINICIANS", "jabb": "CA-CANCER J CLIN", "issn": "0007-9235", "eissn": "1542-4863", "IF": "503.1", "Q": "Q1", "B": "B1", "T": "0"}, {"journal": "NATURE REVIEWS DRUG DISCOVERY", "jabb": "NAT REV DRUG DISCOV", "issn": "1474-1776", "eissn": "1474-1784", "IF": "122.7", "Q": "Q1", "B": "B1", "T": "1"}, {"journal": "LANCET", "jabb": "LANCET", "issn": "0140-6736", "eissn": "1474-547X", "IF": "98.4", "Q": "Q1", "B": "B1", "T": "1"}]
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Journal of Modern Chinese History
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Frontiers of History in China
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Feministische Studien
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Frontiers of Philosophy in China
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Souls
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Scandia
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Frontiers of Literary Studies in China
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Tsinghua China Law Review
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Journal for the Study of Christian Culture
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Revue Roumaine de Linguistique-Romanian Review of Linguistics
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Journal of Cellular Automata
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Kleintierpraxis
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Philippine Journal of Crop Science
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Cadmo
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Informacao & Sociedade-Estudos
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for REVUE D ETUDES COMPARATIVES EST-OUEST
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Zeitschrift fur Arznei- & Gewurzpflanzen
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for AEROSPACE AMERICA
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for INTERNATIONAL JOURNAL OF HUMAN GENETICS
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for AATCC REVIEW
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for Correspondances en Metabolismes Hormones Diabetes et Nutrition
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for MANUFACTURING ENGINEERING
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for NATURAL HISTORY
2025-02-28 15:35:35,434 - WARNING - 无效的影响因子值: <0.1 for SECURITIES REGULATION LAW JOURNAL
2025-02-28 15:35:35,440 - WARNING - 无效的影响因子值: <0.1 for SURFACE COATINGS INTERNATIONAL
2025-02-28 15:35:35,440 - WARNING - 无效的影响因子值: <0.1 for WOCHENBLATT FUR PAPIERFABRIKATION
2025-02-28 15:35:35,440 - INFO - 成功加载 35523 条期刊数据
2025-02-28 15:35:35,440 - INFO - 转换后的数据示例: {"00079235": {"title": "CA-A CANCER JOURNAL FOR CLINICIANS", "if": 503.1, "jcr_quartile": "Q1", "cas_quartile": "B1"}, "15424863": {"title": "CA-A CANCER JOURNAL FOR CLINICIANS", "if": 503.1, "jcr_quartile": "Q1", "cas_quartile": "B1"}, "14741776": {"title": "NATURE REVIEWS DRUG DISCOVERY", "if": 122.7, "jcr_quartile": "Q1", "cas_quartile": "B1"}}
2025-02-28 15:35:35,440 - INFO - 开始加载影响因子趋势数据: C:\Users\xxxxxxxxxx\nnscholar-search\nnscholar-search-main\data\journal_metrics\5year.json
2025-02-28 15:35:35,925 - INFO - 成功加载影响因子趋势数据,包含 21716 条记录
2025-02-28 15:35:35,925 - INFO - 首次运行,开始下载NLTK数据...
2025-02-28 15:35:37,032 - INFO - NLTK数据下载完成
2025-02-28 15:35:37,032 - INFO - 正在启动应用服务器...
2025-02-28 15:35:37,049 - WARNING - * Debugger is active!
2025-02-28 15:35:37,049 - INFO - * Debugger PIN: 911-814-275

@luckylykkk
Copy link
Owner

luckylykkk commented Feb 28, 2025 via email

@Roger-GOAT
Copy link
Author

Roger-GOAT commented Mar 1, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants