Changes between Version 1 and Version 2 of cs/ZpracovaniMetainformaci
- Timestamp:
- Jul 1, 2014, 2:00:36 PM (10 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
cs/ZpracovaniMetainformaci
v1 v2 27 27 $ cat /nlp/corpora/vertical/INFO_O_DUPLICITACH/vertikal.list \ 28 28 | egrep ':eco:eco$' \ #vybereme radky odpovidajici dokumentum z ekonomiky 29 | cut -d: -f4-6 \ 30 | list2vert 29 | cut -d: -f4-6 \ #vybereme udaje pro program list2vert 30 | list2vert #ze seznamu dokumentu vytvorime vertikalni text 31 31 }}} 32 32 == Metainformace jako SQL databáze == #Metainformace_jako_SQL_datab.2BAOE-ze … … 44 44 4 sloupce 45 45 mysql> explain vertical_M; 46 +--------+--------------------- +------+-----+---------+-------+47 | Field | Type 48 +--------+--------------------- +------+-----+---------+-------+49 | Doc | varchar(255) binary | | MUL | ||50 | M_Auth | varchar(255) | YES | | NULL ||51 | M_Date | date | YES | | NULL ||52 | M_Desc | varchar(255) | YES | | NULL ||46 +--------+----------------------+------+-----+---------+-------+ 47 | Field | Type | Null | Key | Default | Extra | 48 +--------+----------------------+------+-----+---------+-------+ 49 | Doc | varchar(255) binary | | MUL | | | 50 | M_Auth | varchar(255) | YES | | NULL | | 51 | M_Date | date | YES | | NULL | | 52 | M_Desc | varchar(255) | YES | | NULL | | 53 53 +--------+---------------------+------+-----+---------+-------+ 54 54 }}} … … 145 145 $ head -2 /nlp/corpora/vertical/SQL_DATA/vertical 146 146 Y Y \N 1998-01-01 \N S/J/1998/150/001 150 hori 1998-01-01 \N cs 99 88 j 101 5373 \N 6F59FE5D 695 cnk_nove \N sec Y \N \N cnk/150hori/150.vert 147 Y Y \N 1998-01-01 \N S/J/1998/150/003 150 hori 1998-01-01 \N cs 94 69 j 7858 147 Y Y \N 1998-01-01 \N S/J/1998/150/003 150 hori 1998-01-01 \N cs 94 69 j 7858 4463 \N 5E84F5FB 662 cnk_nove \N sec Y \N \N cnk/150hori/150.vert 148 148 }}} 149 149 Převod z metainformací do tohoto formátu provede program meta2sql, který jako svůj vstup očekává jméno souboru a vytvoří dva soubory s koncovkami .sql1 a .sql2. Takže použití: … … 175 175 | CONCAT(Position_offset,',',Position_length,':',Doc,':',Vertikal) | 176 176 +------------------------------------------------------------------+ 177 | 5231183,14467:S/NWS/1995/hnb5/1994:cnk/HN/hnb5.vert 178 | 4645,15025:S/NWS/1999/ln99086e/003:cnk/LN/1999/ln99086e.vert 177 | 5231183,14467:S/NWS/1995/hnb5/1994:cnk/HN/hnb5.vert | 178 | 4645,15025:S/NWS/1999/ln99086e/003:cnk/LN/1999/ln99086e.vert | 179 179 +------------------------------------------------------------------+ 180 180 2 rows in set (0.07 sec)