[Sqoop-User] Sqoop modifies the DATE format in the exported data

Hi All,
I am exporting a table from Oracle using Sqoop. I have a date column in
Oracle table with  format as DD-MON-YY. I get the same format when i dump
the data from Oracle SQL Developer. But, when i dump the data using Sqoop i
get the following format YYYY-MM-DD HH:MM:SS.x.
For the value "24-JAN-08" in DB, Sqoop will dump it as  "2008-01-24
00:00:00.0". Is this an expected behavior? If yes, please let me know why
does sqoop adds the unnecessary timestamp at the end and also modifies the
original date format?
-- 
Thanks & Regards,
Anil Gupta

Reply To : Sqoop Modifies The DATE Format In The Exported Data

asked Mar 12 2012 at 20:57

anil gupta

5 Replies for : Sqoop Modifies The DATE Format In The Exported Data

Hi Anil,
Some of the Oracle JDBC drivers (version < 9.2 && > 11.1) auto-converts
date to timestamp:
http://www.oracle.com/technetwork/database/enterprise-edition/jdbc-faq-090281.html#08_01
Since Sqoop uses the JDBC driver to import data from the Oracle db, dates
in output files are in the form of timestamp.
Thanks,
Cheolsoo

Reply To : Sqoop Modifies The DATE Format In The Exported Data

answered Mar 12 2012 at 21:08

Cheolsoo Park

Hi Cheolsoo,
Thanks for the inputs. AFAIK, SQL Developer also uses JDBC but its dumping
the data in the same format as its in DB. So, i am wondering why Sqoop is
unable to dump the data similar to SQL Developer? I am using SQL Developer
3.1.07.
Thanks,
Anil Gupta

Reply To : Sqoop Modifies The DATE Format In The Exported Data

answered Mar 12 2012 at 21:17

anil gupta

>From the link you had sent me in previous mail. Here is something which
might stop the date being converted into Timestamp:
"If for some reason your app is very sensitive to this change and you
simply must have the 9i-10g behavior, there is a connection property you
can set. Set mapDateToTimestamp to false and the driver will revert to the
default 9i-10g behavior and map DATE to Date. "  Will this help in
resolving this problem?
Thanks,
Anil

Reply To : Sqoop Modifies The DATE Format In The Exported Data

answered Mar 12 2012 at 21:26

anil gupta

Hi Anil,
This issue is already being tracked by the following Apache JIRAs:
https://issues.apache.org/jira/browse/SQOOP-451
https://issues.apache.org/jira/browse/SQOOP-456
In fact, I am currently working on adding new options via which the user
can specify format masks for date, time, and timestamp.
Thanks,
Cheolsoo

Reply To : Sqoop Modifies The DATE Format In The Exported Data

answered Mar 12 2012 at 21:31

Cheolsoo Park

Thanks Cheolsoo, SQOOP-456 will address my problem. In the meantime, i will
be moving ahead by truncating the time part from the export and then
processing the data because in my use-case the time is stored in another
column.
Thanks,
Anil

Reply To : Sqoop Modifies The DATE Format In The Exported Data

answered Mar 12 2012 at 21:54

anil gupta

Related discussions

How Does One Preprocess The Data So That They Can Be Exported Using Sqoop

Hi I would be grateful for any tips on how to "prepare" the data so they can be exported to a Postgesql Database using sqoop. As an example: Provided some files of events. (user events, product events, productActivity events) [file0001] event:user propertes:{name:"john" ...} event:product properties:{ref:123,color:"blue",... event:productActivity properties:{user:"john", product:"ref", action:"

Special Character In Sqoop Exported Data

I trying to import HDFS data to Mysql using the following command: /opt/cloudera/parcels/CDH-5.0.0-1.cdh5.0.0.p0.47/bin/sqoop export --driver com.mysql.jdbc.Driver --connect jdbc:mysql://server:port/dbname --table table_name --username user --password pwd --export-dir /user/hdfs/hiveoutput/consolidatedconsumption/Hits/* --input-fields-terminated-by '\t' --input-lines-terminated-by '\n'

What Is The Data Format Of Kafka's Data Nodes In ZooKeeper?

I'm trying to read data from ZooKeeper nodes that was written by different Kafka components. As a specific example (just one from a bunch), I'm trying to read current offset for specific group, topic and partition. As far as I understand, it is stored under the path /consumers/data-processing-team/offsets/unloads/35 I'm using `com.101tec.zkclient` to get data. I'm able to walk through

Impalad Crashed When Query Data Stored In The RCFILE Format Stored In Hive Table

Hi, I have installed impala 0.6 and CDH 4.2, i have setuped my cluster with three data nodes and a namenode. First, i created a table data stored as TEXTFILE format in hive, And i have loaded about 150 millons rows into the table, I could query data in hive and in impalad-shell without any errors, But it was too slow query speed(described on https://groups.google.com/a/cloudera.org/forum/#!topic

Load Data Infile Date Format Issue

I have a text file which is extracted from a non-sql database each night and then a cron sql script runs to insert the text data into the mysql database tables. My problem is that the date data in the text file is formatted incosistently (12/31/00 or 12-31-00) and so the fields that hold date data are currently char datatypes. Since I need the dates to be dates for queries, I need a solution

MongoDb Date Format

I was doing bulk insert into MongoDB using NodeJs (native -driver). I have date field in the data. Is there anyway to store the date field as Date rather than String? I have date in dd/mm/yyyy format. In current scenario I attain the result by iterating through the bulk data converting the date into mm/dd/yyyy format, then create new Date and save. Since the iteration takes too much time

Problem With The SQOOP Imported Data In HDFS

Hello Everyone, When I tried to import the below data from an Oracle table (columns delimited by ',') to HDFS using the below mentioned Sqoop command, 12345,1-1SKCE5P,null,2013-10-11 06:23:22.0,2014-12-02 14:22:32.0,Switched "INFONET CONFERENCING" GSP P3519,null,OS sqoop import --connect jdbc:oracle:thin:@//xxxxx:xxxxx/xxxx_xxxx --username SUMAN --password-file /user/$USER/sqoop.password --

Get Only The Date From DateType Data

This is the CREATED_TIME *2009-12-14 10:15:54* * * How I can get only the date part from the above created_time, just like below. *2009-12-14* Any suggestions will be appreciated. *Raihan Jamal*

How To Import Data With A Different Date Format

Hi, I am attempting to import some of our data into SOLR. I did it the quickest way I know because I literally only have 2 days to import the data and do some queries for a proof-of-concept. So I have this data in XML format and I wrote a short XSLT script to convert it to the format in solr/example/exampledocs (except I retained the element names so I had to modify schema.xml in the conf directory

Newbie - Hive Tutorial Question, What Format Is The Sample Data File In?

I'm going through the tutorial at https://cwiki.apache.org/Hive/tutorial.html . It's not clear to me what the exact format of the log file would be for the sample queries described eg at https://cwiki.apache.org/Hive/tutorial.html#Tutorial-LoadingData I can't find a link to download such a file and while I'd be happy to construct one myself it's not clear to me what a viewTime of type INT would

Why CQL Returns Data In Byte Format, While Hive De-serialize And Return The Data In Readable Format

Hi, We are using same underlying column family and extract the data using Hive query and CQL query. Column family meta data contains Comparator='IntegerType' and default_validation_class = FloatType. CREATE COLUMN FAMILY cpu_avg_5min WITH COMPARATOR = 'IntegerType' AND key_validation_class = UTF8Type AND default_validation_class = FloatType; Queering through Hive using a Hive query returns readable

RuntimeException: Invalid Version (expected 2, But 60) Or The Data In Not In 'javabin' Format

Hi, I'm using Nutch 2.1 (Inside Eclipse) + Solr 4.0.0 with schema-solr4.xml. The run configuration in eclipse is: org.apache.nutch.crawl.Crawler urls -solr http://localhost:8080/solr/#/collection2 -threads 1 -depth 1 -topN 3 -Dhadoop.log.dir=logs -Dhadoop.log.file=hadoop.log Rarely, it works fine, but most time there's an exception in console: Adding 1 documents Exception in thread "main" java.lang

Invalid Version Or The Data In Not In 'javabin' Format

hi... currently i am integrating nutch (release 1.2) into solr (trunk). if i indexing to solr index with nutch i got the exception: java.lang.RuntimeException: Invalid version or the data in not in 'javabin' format at org.apache.solr.common.util.JavaBinCodec.unmarshal(JavaBinCodec.java:99) at org.apache.solr.client.solrj.impl.BinaryResponseParser.processResponse(BinaryResponseParser

Data In Unicode-Format

We are looking for databases with Unicode implementation. In the TODO list there is an entry "Add support for UNICODE". When will this feature be available? Display of Chinese Characters GmbH Tel : +49 511 / 9357-810 Sven Just Fax : +49 511 / 9357-819 Vahrenwalder Str. 7 WWW : http://www.dcc-asia.de 30165 Hannover, Germany

What Is The Format Of Data Contained In A Named List?

A named list contains a key value pair. At the very basic level, if we want to access the data that is contained in named list NamedList foo = thisIsSolrQueryResponseObject.getValues(); Entry bar = null; // Creating a iterator to iterate through the response Iterator It =foo.iterator(); while (It.hasNext()) { bar = It.next(); SolrDocumentList solDocLst = (SolrDocumentList) bar.getValue(); for (int

How To Filter Data In Ng-grid On The Basis Of Date Range

Using $scope.gridOptions.filterOptions.filterText i am able to filter my data. For example to filter on the basis of Destination Country, i am setting the filterText as: $scope.gridOptions.filterOptions.filterText += 'DestinationCountry:' + $scope.filter.DestinationCountry + ';'; However i am unable to figure out how to filter a date column on the basis of date range i.e. between from-date and

Invalid Version (expected 2, But 60) Or The Data In Not In 'javabin' Format

I am using Solr 4.3.1 on solrcloud with 10 nodes. I added 3 million documents from a csv file with this command curl 'http://localhost:8080/solr/trcollection2/update/csv?stream.file=/home/hduser/csvFile.csv&skipLines=1&fieldnames=,cache,segment,digest,tstamp,lang,url,,content,id,title,boost&stream.contentType=text/p lain;charset=utf-8' Then I query the data, fetching first 100K documents

FW: What Is The Format Of Data Contained In A Named List?

Hi, Thanks for your reply. But I need one clarification. When you say it will contain the data you requested for, do you mean the data as requested in fl parameter of the query? Thanks. Aman

Create Table / Data Type Syntax For Csv Files With Comma In The Column

Hello, I have a csv file that has columns which contains commas within a string enclosed with a ". ex: column name:*'Issue' *value:*"Other (phone, health club, etc)"* *Question:* What should the data type of 'Issue' be? Or how should I format the table (row format delimited terminated by) so that the comma in the column (issue) is accounted for correctly I had set it as below but this

Sqoop Modifies The DATE Format In The Exported Data

Related discussions

Sqoop-commits

Sqoop-dev

Sqoop-user