This dataset contains the laws passed by the Iranian parliament (Majlis) up to April 29, 2024. The data has been crawled from the website rc.majlis.ir and contains essential information about each law including its title, date of approval, the approving body, the content of the law, and a reference URL.
- title (عنوان مصوبه): The title of the law.
- date (تاریخ تصویب): The date when the law was approved, converted to datetime format.
- reference (مرجع تصویب): The approving body.
- content (جزئیات متن قانون): The detailed content of the law, with stemming applied.
- url (لینک مصوبه): The URL linking to the original law on the rc.majlis.ir website.
- The
date
column has been converted todatetime
format. - The
title
andcontent
columns have been striped to facilitate text analysis tasks.
This dataset is suitable for a variety of tasks including, but not limited to:
- Text analysis and natural language processing (NLP) tasks.
- Legal research and comparative legal studies.
- Trend analysis in legislative activities.
- Machine learning projects focusing on text classification or clustering.
To help you get started, here are a few example tasks you could perform with this dataset:
- Text Classification: Classify laws based on their titles or content.
- Trend Analysis: Analyze trends in legislative activity over time.
- NLP Tasks: Apply various NLP techniques such as topic modeling, sentiment analysis, or named entity recognition.
To get started with this dataset, simply download the law_cleaned.csv
file and load it into your preferred data analysis tool. For example, in Python, you can use pandas:
import pandas as pd
# Load the dataset
df = pd.read_csv('law_cleaned.csv')
# Display the first few rows
print(df.head())
The data was collected by crawling the rc.majlis.ir website, which contains a comprehensive archive of Iranian laws. The dataset includes all available laws up to April 29, 2024.
You can also find this dataset on the kaggle from here.