1*008b33e8Schristos 2*008b33e8Schristos#------------------------------------------------------------------------------ 3*008b33e8Schristos# $File: apache,v 1.1 2017/04/11 14:52:15 christos Exp $ 4*008b33e8Schristos# apache: file(1) magic for Apache Big Data formats 5*008b33e8Schristos 6*008b33e8Schristos# Avro files 7*008b33e8Schristos0 string Obj Apache Avro 8*008b33e8Schristos>3 byte x version %d 9*008b33e8Schristos 10*008b33e8Schristos# ORC files 11*008b33e8Schristos# Important information is in file footer, which we can't index to :( 12*008b33e8Schristos0 string ORC Apache ORC 13*008b33e8Schristos 14*008b33e8Schristos# Parquet files 15*008b33e8Schristos0 string PAR1 Apache Parquet 16*008b33e8Schristos 17*008b33e8Schristos# Hive RC files 18*008b33e8Schristos0 string RCF Apache Hive RC file 19*008b33e8Schristos>3 byte x version %d 20*008b33e8Schristos 21*008b33e8Schristos# Sequence files (and the careless first version of RC file) 22*008b33e8Schristos 23*008b33e8Schristos0 string SEQ 24*008b33e8Schristos>3 byte <6 Apache Hadoop Sequence file version %d 25*008b33e8Schristos>3 byte >6 Apache Hadoop Sequence file version %d 26*008b33e8Schristos>3 byte =6 27*008b33e8Schristos>>5 string org.apache.hadoop.hive.ql.io.RCFile$KeyBuffer Apache Hive RC file version 0 28*008b33e8Schristos>>3 default x Apache Hadoop Sequence file version 6 29