Skip to content

Commit 60f673d

Browse files
committed
feat: Turkish language support has been added to README.md
1 parent 65add5f commit 60f673d

File tree

2 files changed

+235
-21
lines changed

2 files changed

+235
-21
lines changed

README.md

+27-21
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,8 @@
1-
21
# 🕷️ ScrapeGraphAI: You Only Scrape Once
2+
33
[English](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/README.md) | [中文](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/chinese.md) | [日本語](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/japanese.md)
44
| [한국어](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/korean.md)
5-
| [Русский](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/russian.md)
6-
5+
| [Русский](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/russian.md) | [Türkçe](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/turkish.md)
76

87
[![Downloads](https://img.shields.io/pepy/dt/scrapegraphai?style=for-the-badge)](https://pepy.tech/project/scrapegraphai)
98
[![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen?style=for-the-badge)](https://github.com/pylint-dev/pylint)
@@ -12,7 +11,7 @@
1211
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg?style=for-the-badge)](https://opensource.org/licenses/MIT)
1312
[![](https://dcbadge.vercel.app/api/server/gkxQDAjfeX)](https://discord.gg/gkxQDAjfeX)
1413

15-
ScrapeGraphAI is a *web scraping* python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).
14+
ScrapeGraphAI is a _web scraping_ python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.).
1615

1716
Just say which information you want to extract and the library will do it for you!
1817

@@ -39,9 +38,11 @@ Additional dependecies can be added while installing the library:
3938
- <b>More Language Models</b>: additional language models are installed, such as Fireworks, Groq, Anthropic, Hugging Face, and Nvidia AI Endpoints.
4039

4140
This group allows you to use additional language models like Fireworks, Groq, Anthropic, Together AI, Hugging Face, and Nvidia AI Endpoints.
41+
4242
```bash
4343
pip install scrapegraphai[other-language-models]
4444
```
45+
4546
- <b>Semantic Options</b>: this group includes tools for advanced semantic processing, such as Graphviz.
4647

4748
```bash
@@ -56,13 +57,12 @@ Additional dependecies can be added while installing the library:
5657

5758
</details>
5859

59-
6060
## 💻 Usage
61+
6162
There are multiple standard scraping pipelines that can be used to extract information from a website (or local file).
6263

6364
The most common one is the `SmartScraperGraph`, which extracts information from a single page given a user prompt and a source URL.
6465

65-
6666
```python
6767
import json
6868
from scrapegraphai.graphs import SmartScraperGraph
@@ -98,16 +98,17 @@ The output will be a dictionary like the following:
9898
"contact_email": "contact@scrapegraphai.com"
9999
}
100100
```
101+
101102
There are other pipelines that can be used to extract information from multiple pages, generate Python scripts, or even generate audio files.
102103

103-
| Pipeline Name | Description |
104-
|-------------------------|------------------------------------------------------------------------------------------------------------------|
105-
| SmartScraperGraph | Single-page scraper that only needs a user prompt and an input source. |
106-
| SearchGraph | Multi-page scraper that extracts information from the top n search results of a search engine. |
107-
| SpeechGraph | Single-page scraper that extracts information from a website and generates an audio file. |
108-
| ScriptCreatorGraph | Single-page scraper that extracts information from a website and generates a Python script. |
109-
| SmartScraperMultiGraph | Multi-page scraper that extracts information from multiple pages given a single prompt and a list of sources. |
110-
| ScriptCreatorMultiGraph | Multi-page scraper that generates a Python script for extracting information from multiple pages and sources. |
104+
| Pipeline Name | Description |
105+
| ----------------------- | ------------------------------------------------------------------------------------------------------------- |
106+
| SmartScraperGraph | Single-page scraper that only needs a user prompt and an input source. |
107+
| SearchGraph | Multi-page scraper that extracts information from the top n search results of a search engine. |
108+
| SpeechGraph | Single-page scraper that extracts information from a website and generates an audio file. |
109+
| ScriptCreatorGraph | Single-page scraper that extracts information from a website and generates a Python script. |
110+
| SmartScraperMultiGraph | Multi-page scraper that extracts information from multiple pages given a single prompt and a list of sources. |
111+
| ScriptCreatorMultiGraph | Multi-page scraper that generates a Python script for extracting information from multiple pages and sources. |
111112

112113
For each of these graphs there is the multi version. It allows to make calls of the LLM in parallel.
113114

@@ -116,6 +117,7 @@ It is possible to use different LLM through APIs, such as **OpenAI**, **Groq**,
116117
Remember to have [Ollama](https://ollama.com/) installed and download the models using the **ollama pull** command, if you want to use local models.
117118

118119
## 🔍 Demo
120+
119121
Official streamlit demo:
120122

121123
[![My Skills](https://skillicons.dev/icons?i=react)](https://scrapegraph-ai-web-dashboard.streamlit.app)
@@ -131,6 +133,7 @@ The documentation for ScrapeGraphAI can be found [here](https://scrapegraph-ai.r
131133
Check out also the Docusaurus [here](https://scrapegraph-doc.onrender.com/).
132134

133135
## 🏆 Sponsors
136+
134137
<div style="text-align: center;">
135138
<a href="https://2ly.link/1zaXG">
136139
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/browserbase_logo.png" alt="Browserbase" style="width: 10%;">
@@ -156,15 +159,18 @@ Please see the [contributing guidelines](https://github.com/VinciGit00/Scrapegra
156159
[![My Skills](https://skillicons.dev/icons?i=linkedin)](https://www.linkedin.com/company/scrapegraphai/)
157160
[![My Skills](https://skillicons.dev/icons?i=twitter)](https://twitter.com/scrapegraphai)
158161

159-
## 📈 Telemetry
160-
We collect anonymous usage metrics to enhance our package's quality and user experience. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the environment variable SCRAPEGRAPHAI_TELEMETRY_ENABLED=false. For more information, please refer to the documentation [here](https://scrapegraph-ai.readthedocs.io/en/latest/scrapers/telemetry.html).
162+
## 📈 Telemetry
161163

164+
We collect anonymous usage metrics to enhance our package's quality and user experience. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the environment variable SCRAPEGRAPHAI_TELEMETRY_ENABLED=false. For more information, please refer to the documentation [here](https://scrapegraph-ai.readthedocs.io/en/latest/scrapers/telemetry.html).
162165

163166
## ❤️ Contributors
167+
164168
[![Contributors](https://contrib.rocks/image?repo=VinciGit00/Scrapegraph-ai)](https://github.com/VinciGit00/Scrapegraph-ai/graphs/contributors)
165169

166170
## 🎓 Citations
171+
167172
If you have used our library for research purposes please quote us with the following reference:
173+
168174
```text
169175
@misc{scrapegraph-ai,
170176
author = {Marco Perini, Lorenzo Padoan, Marco Vinciguerra},
@@ -181,11 +187,11 @@ If you have used our library for research purposes please quote us with the foll
181187
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/logo_authors.png" alt="Authors_logos">
182188
</p>
183189

184-
| | Contact Info |
185-
|--------------------|----------------------|
186-
| Marco Vinciguerra | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/marco-vinciguerra-7ba365242/) |
187-
| Marco Perini | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/perinim/) |
188-
| Lorenzo Padoan | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/lorenzo-padoan-4521a2154/) |
190+
| | Contact Info |
191+
| ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
192+
| Marco Vinciguerra | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/marco-vinciguerra-7ba365242/) |
193+
| Marco Perini | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/perinim/) |
194+
| Lorenzo Padoan | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/lorenzo-padoan-4521a2154/) |
189195

190196
## 📜 License
191197

docs/turkish.md

+208
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,208 @@
1+
# 🕷️ ScrapeGraphAI: You Only Scrape Once
2+
3+
[English](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/README.md) | [中文](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/chinese.md) | [日本語](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/japanese.md)
4+
| [한국어](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/korean.md)
5+
| [Русский](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/russian.md) | [Turkish](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/docs/turkish.md)
6+
7+
[![Downloads](https://img.shields.io/pepy/dt/scrapegraphai?style=for-the-badge)](https://pepy.tech/project/scrapegraphai)
8+
[![linting: pylint](https://img.shields.io/badge/linting-pylint-yellowgreen?style=for-the-badge)](https://github.com/pylint-dev/pylint)
9+
[![Pylint](https://img.shields.io/github/actions/workflow/status/VinciGit00/Scrapegraph-ai/pylint.yml?label=Pylint&logo=github&style=for-the-badge)](https://github.com/VinciGit00/Scrapegraph-ai/actions/workflows/pylint.yml)
10+
[![CodeQL](https://img.shields.io/github/actions/workflow/status/VinciGit00/Scrapegraph-ai/codeql.yml?label=CodeQL&logo=github&style=for-the-badge)](https://github.com/VinciGit00/Scrapegraph-ai/actions/workflows/codeql.yml)
11+
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg?style=for-the-badge)](https://opensource.org/licenses/MIT)
12+
[![](https://dcbadge.vercel.app/api/server/gkxQDAjfeX)](https://discord.gg/gkxQDAjfeX)
13+
14+
ScrapeGraphAI, web siteleri ve yerel belgeler (XML, HTML, JSON, Markdown vb.) için kazıma hatları oluşturmak üzere LLM ve doğrudan grafik mantığını kullanan bir web scraping Python kütüphanesidir.
15+
16+
Sadece çıkarmak istediğiniz bilgiyi belirtin; kütüphane bunu sizin için gerçekleştirecektir!
17+
18+
<p align="center">
19+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/sgai-hero.png" alt="ScrapeGraphAI Hero" style="width: 100%;">
20+
</p>
21+
22+
## 🚀 Hızlı kurulum
23+
24+
ScrapeGraphAI için referans sayfası, PyPI'nin resmi sayfasında mevcuttur: [pypi](https://pypi.org/project/scrapegraphai/).
25+
26+
```bash
27+
pip install scrapegraphai
28+
29+
playwright install
30+
```
31+
32+
**NOT**: Diğer kütüphanelerle çakışmaları önlemek için kütüphaneyi bir sanal ortamda kurmanız önerilir.
33+
34+
<details>
35+
<summary><b>İsteğe Bağlı Bağımlılıklar</b></summary>
36+
37+
Kütüphane kurulumunda ek bağımlılıklar eklenebilir:
38+
39+
- <b>Daha Fazla Dil Modeli</b>: Fireworks, Groq, Anthropic, Hugging Face ve Nvidia AI Endpoints gibi ek dil modelleri yüklenir.
40+
41+
Bu grup, Fireworks, Groq, Anthropic, Together AI, Hugging Face ve Nvidia AI Endpoints gibi ek dil modellerini kullanmanıza olanak tanır.
42+
43+
```bash
44+
pip install scrapegraphai[other-language-models]
45+
```
46+
47+
- <b>Anlamsal Seçenekler</b>: Bu grup, Graphviz gibi ileri düzey anlamsal işleme araçlarını içerir.
48+
49+
```bash
50+
pip install scrapegraphai[more-semantic-options]
51+
```
52+
53+
- <b>Tarayıcı Seçenekleri</b>: Bu grup, Browserbase gibi ek tarayıcı yönetim araçlarını/hizmetlerini içerir.
54+
55+
```bash
56+
pip install scrapegraphai[more-browser-options]
57+
```
58+
59+
</details>
60+
61+
## 💻 Kullanım
62+
63+
Bir web sitesinden (veya yerel dosyadan) bilgi almak için kullanılabilecek birçok standart kazıma hattı vardır.
64+
65+
En yaygın olanı, bir kullanıcı istemi ve bir kaynak URL'si verildiğinde tek bir sayfadan bilgi çıkaran `SmartScraperGraph`'tır.
66+
67+
```python
68+
import json
69+
from scrapegraphai.graphs import SmartScraperGraph
70+
71+
# Kazıma hattı için yapılandırmayı tanımlayın
72+
73+
graph_config = {
74+
"llm": {
75+
"api_key": "YOUR_OPENAI_APIKEY",
76+
"model": "openai/gpt-4o-mini",
77+
},
78+
"verbose": True,
79+
"headless": False,
80+
}
81+
82+
# SmartScraperGraph örneğini oluşturun
83+
84+
smart_scraper_graph = SmartScraperGraph(
85+
prompt="Şirketin ne yaptığı, adı ve iletişim e-postası hakkında bazı bilgiler bulun.",
86+
source="https://scrapegraphai.com/",
87+
config=graph_config
88+
)
89+
90+
# Hattı çalıştırın
91+
92+
result = smart_scraper_graph.run()
93+
print(json.dumps(result, indent=4))
94+
95+
```
96+
97+
Çıktı, aşağıdaki gibi bir sözlük olacaktır:
98+
99+
```python
100+
{
101+
"company": "ScrapeGraphAI",
102+
"name": "ScrapeGraphAI Extracting content from websites and local documents using LLM",
103+
"contact_email": "contact@scrapegraphai.com"
104+
}
105+
```
106+
107+
Birden fazla sayfadan bilgi ayıklamak, Python komut dosyaları oluşturmak ve hatta ses dosyaları oluşturmak için kullanılabilecek başka işlem hatları da vardır.
108+
109+
| Pipeline Name | Description |
110+
| ----------------------- | ------------------------------------------------------------------------------------------------------------- |
111+
| SmartScraperGraph | Single-page scraper that only needs a user prompt and an input source. |
112+
| SearchGraph | Multi-page scraper that extracts information from the top n search results of a search engine. |
113+
| SpeechGraph | Single-page scraper that extracts information from a website and generates an audio file. |
114+
| ScriptCreatorGraph | Single-page scraper that extracts information from a website and generates a Python script. |
115+
| SmartScraperMultiGraph | Multi-page scraper that extracts information from multiple pages given a single prompt and a list of sources. |
116+
| ScriptCreatorMultiGraph | Multi-page scraper that generates a Python script for extracting information from multiple pages and sources. |
117+
118+
Bu grafiklerin her biri için çoklu versiyonu vardır. Bu, LLM'yi paralel olarak çağırmayı sağlar.
119+
120+
Farklı LLM'leri API'ler aracılığıyla kullanmak mümkündür, örneğin **OpenAI**, **Groq**, **Azure** ve **Gemini**, veya **Ollama** kullanarak yerel modeller.
121+
122+
Yerel modelleri kullanmak istiyorsanız, [Ollama](https://ollama.com/) kurulu olduğundan emin olun ve modelleri indirmek için **ollama pull** komutunu kullanın.
123+
124+
## 🔍 Demo
125+
126+
Resmi Streamlit demosu:
127+
128+
[![My Skills](https://skillicons.dev/icons?i=react)](https://scrapegraph-ai-web-dashboard.streamlit.app)
129+
130+
Bunu doğrudan web üzerinde Google Colab kullanarak deneyin:
131+
132+
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1sEZBonBMGP44CtO6GQTwAlL0BGJXjtfd?usp=sharing)
133+
134+
## 📖 Dokümantasyon
135+
136+
ScrapeGraphAI için dokümantasyonu [buradan](https://scrapegraph-ai.readthedocs.io/en/latest/) bulabilirsiniz.
137+
138+
Docusaurus'u da [buradan](https://scrapegraph-doc.onrender.com/) kontrol edin.
139+
140+
## 🏆 Sponsorlar
141+
142+
<div style="text-align: center;">
143+
<a href="https://2ly.link/1zaXG">
144+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/browserbase_logo.png" alt="Browserbase" style="width: 10%;">
145+
</a>
146+
<a href="https://2ly.link/1zNiz">
147+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/serp_api_logo.png" alt="SerpAPI" style="width: 10%;">
148+
</a>
149+
<a href="https://2ly.link/1zNj1">
150+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/transparent_stat.png" alt="Stats" style="width: 15%;">
151+
</a>
152+
<a href="https://scrape.do">
153+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/scrapedo.png" alt="Stats" style="width: 11%;">
154+
</a>
155+
</div>
156+
157+
## 🤝 Katkıda Bulunma
158+
159+
Katkıda bulunmaktan çekinmeyin ve iyileştirmeleri tartışmak ve önerilerinizi iletmek için Discord sunucumuza katılın!
160+
161+
Lütfen [katkı sağlama yönergelerini](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/CONTRIBUTING.md) inceleyin.
162+
163+
[![My Skills](https://skillicons.dev/icons?i=discord)](https://discord.gg/uJN7TYcpNa)
164+
[![My Skills](https://skillicons.dev/icons?i=linkedin)](https://www.linkedin.com/company/scrapegraphai/)
165+
[![My Skills](https://skillicons.dev/icons?i=twitter)](https://twitter.com/scrapegraphai)
166+
167+
## 📈 Telemetri
168+
169+
Paketin kalitesini ve kullanıcı deneyimini geliştirmek için anonim kullanım istatistikleri topluyoruz. Bu veriler, iyileştirmeleri önceliklendirmemize ve uyumluluğu sağlamamıza yardımcı olur. Eğer bu verileri almak istemiyorsanız, ortam değişkenini SCRAPEGRAPHAI_TELEMETRY_ENABLED=false olarak ayarlayın. Daha fazla bilgi için lütfen dokümantasyona [buradan](https://scrapegraph-ai.readthedocs.io/en/latest/scrapers/telemetry.html) bakın.
170+
171+
## ❤️ Katkıda Bulunanlar
172+
173+
[![Contributors](https://contrib.rocks/image?repo=VinciGit00/Scrapegraph-ai)](https://github.com/VinciGit00/Scrapegraph-ai/graphs/contributors)
174+
175+
## 🎓 Atıflar
176+
177+
Eğer kütüphanemizi araştırma amaçlı kullandıysanız, lütfen aşağıdaki referansla atıfta bulunun:
178+
179+
```text
180+
@misc{scrapegraph-ai,
181+
author = {Marco Perini, Lorenzo Padoan, Marco Vinciguerra},
182+
title = {Scrapegraph-ai},
183+
year = {2024},
184+
url = {https://github.com/VinciGit00/Scrapegraph-ai},
185+
note = {A Python library for scraping leveraging large language models}
186+
}
187+
```
188+
189+
## Yazarlar
190+
191+
<p align="center">
192+
<img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/logo_authors.png" alt="Authors_logos">
193+
</p>
194+
195+
| | İletişim Bilgisi |
196+
| ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
197+
| Marco Vinciguerra | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/marco-vinciguerra-7ba365242/) |
198+
| Marco Perini | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/perinim/) |
199+
| Lorenzo Padoan | [![Linkedin Badge](https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/lorenzo-padoan-4521a2154/) |
200+
201+
## 📜 Lisans
202+
203+
ScrapeGraphAI, MIT Lisansı altında lisanslanmıştır. Daha fazla bilgi için [LİSANS](https://github.com/VinciGit00/Scrapegraph-ai/blob/main/LICENSE) dosyasına bakın.
204+
205+
## Teşekkürler
206+
207+
- Projeye katkıda bulunan tüm katkı sahiplerine ve açık kaynak topluluğuna destekleri için teşekkür etmek isteriz.
208+
- ScrapeGraphAI, yalnızca veri keşfi ve araştırma amaçları için kullanılmak üzere tasarlanmıştır. Kütüphanenin herhangi bir kötüye kullanımından sorumlu değiliz.

0 commit comments

Comments
 (0)