dfsql

IchHabeKeineNamen (Banyc)

documentation

Revision: the standalone count command is replaced with len, so make sure to replace (count) and col "count" with len and col "len" respectively.
- the unary count <col> command is unaffected.

cargo install dfsql

dfsql --input your.csv --output a-new.csv
# ...or
dfsql -i your.csv -o a-new.csv

exit/quit: exit the REPL loop.
```
exit
```
undo: undo the previous successful operation.
```
undo
```
reset: reset all the changes and go back to the original data frame.
```
reset
```
schema: show column names and types of the data frame.
```
schema
```
save: save the current data frame to a file.
```
save a-new.csv
```

filter

filter <expr>

filter first_name = "John"

sort

sort ((asc | desc | ()) <col>)*

sort icpsr_id

join

(left | right | inner | full) join <var> on <col> <col>?

left join other on id ID

col: reference to a column.

col : (<str> | <var>) -> <expr>

select col first_name

exclude: remove columns from the data frame.

exclude : <expr>* -> <expr>

select exclude last_name first_name

literal: literal values like 42, "John", 1.0, and null.
binary operations
```
select a * b
```
- Calculate the product of columns "a" and "b" and collect the result.
unary operations
```
select -a
```
```
select sum a
```
- Sum all values in column "a" and collect the scalar result.
alias: assign a name to a column.
```
alias : (<col> | <var>) <expr> -> <expr>
```
```
select alias product a * b
```
- Assign the name "product" to the product and collect the new column.

conditional

<conditional> : if <expr> then <expr> (if <expr> then <expr>)* otherwise <expr> -> <expr>

select if class = 0 then "A" if class = 1 then "B" else null

Commit count: 121