Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

In this package, there is only one function

```
```julia
scrape_tables(url)
```

Expand All @@ -12,7 +12,7 @@ By default the function uses `Cascadia.nodeText` to extract the text from each `

However, if you wish to extract more than the text node you may want to use

```
```julia
scrape_tables(url, identity)
```

Expand All @@ -22,7 +22,7 @@ Also, you can put any callable into the `cell_transform` argument to do custom t

E.g.

```
```julia
scrape_tables(url, cell_transform)
```

Expand All @@ -34,7 +34,7 @@ scrape_tables(url, cell_transform)

The returned table is TableScraper.Table which is defined as below

```
```julia
struct Table
rows
columnnames
Expand All @@ -43,7 +43,7 @@ end

So if you need to scrape some malformed tables, you can directly manipulate the data as in the below example

```
```julia
url = "https://www.ssa.gov/oact/NOTES/as120/images/LD_fig5.html"
tbl = only(TableScraper.scrape_tables(url, strip ∘ nodeText))

Expand Down