User Activity

  • Posted a comment on discussion Help on Open Source Data Quality and Profiling

    Hi, First you can decide what data quality dimension you are looking for Like Completeness , Accuracy, Validity, Uniqueness Then follow do the quality rules to do that Like For Completeness -- Check Null and Empty column For Uniqueness - Check de-Dedup Hope it helps On Wed, Feb 1, 2023 at 3:26 AM Mazen Saie mazen-saie@users.sourceforge.net wrote: Hope you are doing well, Im in process of implementing a data quality project, and one of the current phase's deliverable is data quality dimensions, the...

  • Posted a comment on discussion Help on Open Source Data Quality and Profiling

    Here is link of video to explain some features of osDQ. Also is attached documents to for UDF and Scheduling https://www.youtube.com/watch?v=_Allh9Uraoo&t=59s https://www.youtube.com/watch?v=_lvcaj8rPb8&t=12s https://www.youtube.com/watch?v=MQMoZeUjJRw&t=56s https://www.youtube.com/watch?v=q4PvY3ty880&t=2s

  • Posted a comment on discussion Help on Open Source Data Quality and Profiling

    8 GB looks to less for bigger file because it needs other components also like SWING and AWT and local variable. You need bigger RAM. There is core component of osDQ at https://github.com/arrahtech/osdq-core Though you will have to write your owb driver. regards. ++++++ I bumped up the RAM to 8 GB (Xmx, Xms) (see below). I have a file that's around 5.6 GB. On loading the file, I get the following error. Any ideas on how to handle files upto 8 GB? Is it possible to invoke the profile at the command...

  • Modified ticket #80 on Open Source Data Quality and Profiling

    Need to keep the password safe

  • Posted a comment on ticket #80 on Open Source Data Quality and Profiling

    closing it

  • Posted a comment on ticket #80 on Open Source Data Quality and Profiling

    in release 6.3.1 password is masked with *

  • Posted a comment on discussion Help on Open Source Data Quality and Profiling

    https://stackoverflow.com/questions/1565388/increase-heap-size-in-java make changes in runprofiler.[sh][bat]

  • Posted a comment on discussion Help on Open Source Data Quality and Profiling

    loading is slow because snowflake is cloud database. You have to be co-located to have faster access. There is some URL which talks about above error https://support.snowflake.net/s/question/0D50Z00008TSnlZSAT/jdbc-fetching-query-result-failed-with-the-target-server-failed-to-respond https://discourse.metabase.com/t/metabase-failing-to-fetch-large-dataset/5789/2 Seems like a large data so you need to increase heap space in "C:\Users\przibylla\Desktop\ProfilerV6.2.9\ProfilerV6.2.9>java -Xmx4096M -Xms4096M...

View All

Personal Data

Username:
arrah
Joined:
2006-08-16 10:54:54

Projects

This is a list of open source software projects that arrah is associated with:

Personal Tools