第87回PHP勉強会 LT 知って得するかもしれないテキスト処理コマンドのお話

23
知って得するかもしれない テキスト処コマンドのお話 @serima

Transcript of 第87回PHP勉強会 LT 知って得するかもしれないテキスト処理コマンドのお話

  • @serima

  • | PHP

    8

    PHP

  • mysqldump

    scp

  • TreasureData

    TreasureData

  • Excel

  • PHP

  • UNIX

  • cat, sort, uniq, wc, grep, awk, di, cut, sed

  • join

    key

    sort

  • join

    a.txt ID, ,

    b.txt ID,

    join -a1 -e- -o0 1.2 1.3 2.2

  • |

  • |

    cat a.txt | sort > a_sorted.txt

  • A B ID

    1 2 3

    1 3

    2

    A B

  • di awk

    di

  • AB

    cat A B | sort | uniq

    AB

    cat A B | sort | uniq -d

    join A B

  • join

    join

  • tsv csv query

    q - Text as Data

    https://github.com/harelba/q

    q

  • % seq 1 1000 | q "select avg(c1),sum(c1) from -" 500.5 500500

    c11 avg(c1)11000 sum(c1)11000

  • GROUP BY

    % cat test.txt Mike 10 John 1 Bob 8 Mary 19 Bob 15 Bob 20 Mike 2 % q "select c1, count(*), sum(c2) from test.txt group by c1" Bob 3 43 John 1 1 Mary 1 19 Mike 2 12

  • JOIN

    % cat test1.txt a1234 aa1 b1234 aa2 c1234 aa3 d1234 aa3 !% cat test2.txt 1 a1234 bb1 2 a1234 bb2 3 b1234 cc1 4 c1234 dd3 5 e1234 NULL !% q "select * from test1.txt as a left join test2.txt as b on a.c1 = b.c2" a1234 aa1 1 a1234 bb1 a1234 aa1 2 a1234 bb2 b1234 aa2 3 b1234 cc1 c1234 aa3 4 c1234 dd3 d1234 aa3

  • http://serima.co/blog