Hi,
I am trying to dump some data from ORC files in AWS S3 to memsql through Pipeline.
The steps I follow:
- save the ORC data as Parquet in another bucket
- dump the parquet data through a pipeline in memsql
The issue I am facing is that decimal values don’t seem to be getting inserted properly.
The source ORC columns are also set as Decimal data type in Athena eg. (Decimal (18,4), Decimal(19,6)… )
And I have defined those columns as Decimal type with same length and precision in Memsql table as well.
The issue is that the values are incorrect in memsql. For eg. the values 10400.0000 (decimal(18,4)) in Athena is getting inserted as 104000000 in Memsql, although this column is also defined as decimal(18,4)
Also, If I try to see what values are coming in exactly and convert the memsql column to varchar, then for some decimal type columns (for eg. 35.000000 defined as decimal(19,6) in Orc athena ), its getting inserted in Memsql as some non-human readable characters as shown below: