dar-libdar_api Mailing List for DAR - Disk ARchive

For full, incremental, compressed and encrypted backups or archives

Brought to you by: edrusb

dar-libdar_api — discussion area for developpers that want to use the libdar API

You can subscribe to this list here.

2003	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug (1)	Sep (1)	Oct	Nov	Dec
2004	Jan	Feb	Mar	Apr (2)	May	Jun (22)	Jul (14)	Aug	Sep (3)	Oct (3)	Nov (22)	Dec (3)
2005	Jan (3)	Feb	Mar (9)	Apr	May (1)	Jun	Jul (2)	Aug (5)	Sep (2)	Oct (1)	Nov	Dec (1)
2006	Jan	Feb	Mar (2)	Apr	May	Jun	Jul (2)	Aug	Sep	Oct (2)	Nov	Dec
2007	Jan	Feb (5)	Mar (3)	Apr (10)	May (12)	Jun (4)	Jul	Aug	Sep	Oct	Nov	Dec (3)
2008	Jan	Feb	Mar (8)	Apr (9)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2009	Jan (6)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep (4)	Oct	Nov	Dec
2010	Jan (2)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (1)	Nov	Dec
2013	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (13)	Nov (1)	Dec
2014	Jan	Feb (1)	Mar (3)	Apr	May	Jun	Jul	Aug	Sep (6)	Oct (5)	Nov	Dec
2016	Jan (3)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2017	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep (1)	Oct	Nov	Dec
2018	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug (10)	Sep	Oct	Nov	Dec

Flat | Threaded

1 2 3 .. 9 > >> (Page 1 of 9)

[Dar-libdar_api] pre-release phase for 2.6.0 has started

From: Denis C. <dar...@fr...> - 2018-08-18 13:52:43

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Hi,

just to let you know that pre-release phase has started,

http://dar.linux.free.fr/pre-release/

Cheers,
Denis
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIVAwUBW3gknQgxsL0D2LGCAQgzkw/9FCuq9oZWqxxuEXbtL9bIgWyLL0ujuDDl
MyKVSZzLqx2qyKtbBgJbsN+cjf5TytCDIRI9ZXh6wQxNK/+lhmv8A00+8teuIHIt
IrIo79NMxmCKXFa1XoVxfxEK/awqv1E9LxYnGGMUgAvfYbRLkpizHATpX1JO2Zhg
PTVuAE6TJnOaGwLDZO+HXQK1VeLFgMegPltSmsQKaTcmx0fnsIdjR+DVrR5L8O7b
veZtFaZrtRa4BtU8XYb8+ACHeRaQZFWVFElC79rkpXJ3H2q0yoMSPOE7Tl+dutVQ
75bTXW7MNSVjhtgRiUfaXn8EkGhvJQ0bSPRPC65rc4HHxe8NyknzTj3ZfyI0j205
mW9aZzhCM7gIAMppzq9l8gUBT76Ae2mol7bCwmkS/FzqXG5rpKuN/rQuB6z0TqUU
hm/LJU5NNZFjMTeG8kMIW9w7IGBY5UsEWj81SqHaw86MW22YvQdxO+sA1udBFGKu
XYMvK3WJPFv3qMWEQLSgajzDBfIWba6KUEgHsg2UWBPCZ2VaAYRmiYXHwQ3Hn4br
wNE5dPS5qHeTvkK5SaqD8giOTkFJxsgO4ev3A8OrFvq74z185PrfTpbIeKqZWVWp
EyjQ4WogGqy3KcxBj72/v/Dt1qE+bi2q2LI5jGC6YExa6ET7UOaDbqDWnyMpgl8D
CnAX/VRnV68=
=SCri
-----END PGP SIGNATURE-----

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Tobias S. <spe...@gm...> - 2018-08-10 14:07:07

Hi Denis,

this sounds like a big step for the API!
I will check it out and see where it affects Gdar.
Thank you for your offer, I will come back to you if there is something not clear :)

A python3 API would be awesome.

Best regards,
Tobias

Am 10. August 2018 13:49:08 MESZ schrieb Denis Corbin <dar...@fr...>:
>-----BEGIN PGP SIGNED MESSAGE-----
>Hash: SHA256
>
>On 10/08/2018 09:13, Tobias Specht wrote:
>[...]
>> 
>> Hi Denis :)
>
>Hi Tobias! :)
>
>> 
>> nice to hear you are improving the libdar API. Will have a look on
>> it.
>
>sure, so far all new API info is available GIT master, the
>doc/API_tutorial.html is up to date for example. I have removed all
>the pure C calls, replaces as much as possible arguments with standard
>types. In parallel, I also plan to make a C binding for those that do
>not want exceptions and classes... also planning a python-3 binding a
>bit like what Wesley Leggette did some years ago with python-2, but
>due to the longly expected 2.6.0 release this bindings will come right
>after this new major release.
>
>If you need any help to have gdar working with next to come release
>2.6.0 tell me, I can provide support, of course!
>
>> 
>> Best regards, Tobias
>
>Best Regards,
>Denis
>
>
>
>-----BEGIN PGP SIGNATURE-----
>Version: GnuPG v2
>
>iQIVAwUBW217tAgxsL0D2LGCAQh1+BAAkgKJeY+Sr6D1H9E8NO2F5Z5ksOnQe3TN
>4s2PSVHRJuIAC1PU60Vl3RIcPtrUyO3XZuRLBdkLwRZlSZYtD69w20FODf7YfwB4
>OiSAnsZP6dMkMjuNgBh7hB1/chIEnkZSmxtlqFCOwO+f+Ki1Xz2/CAARHzd3NgPo
>B0kUvBlVvNEFYFdyIh5uaMsbFbiUKkdR4QOZ/ROtdo0SfVoG+dRhcc7xu76aGnYw
>ylzVpnZmB02krd6XVpgMp4mnQLzCmjvD8cLAbUfBR2mU52McvJmlcPLSaRhp34xI
>XHai5LLAiSbykY3Sb64yb/7HQT/ZeHoYbsmMD1EeLEg0ohYN6GoPxGwWTrnDZ/H4
>jM0K5oZmVEG8goQn1g3+juHW+T0i2Yxd4mSkPiryUxoJFzj8DmSww/AzEppeX+B+
>OCk9vo6wegwXtHcv7nETX9pezMYN2HaFhLL/MzdOM3z4X2SZc5UEA0SZaI29IqUW
>6tFDRsXnS0GQsZ4TgBNEc1vm1lDRXgYNpZuESbQNvkizjIUcZiHCk3Ri6ViY2n7M
>9Q72o4hzroAx0YOF967FFdQj/8AXKzoMhWup/wCYiSsVCDBJjHtHcNXrvTGgEF2W
>ZgBrwdRVJZ4ClVGjEgWlM996TKE2AmdfngHZrYujO0FAY/eRBpxYmOILC0by1CaK
>g2e1qODzwrM=
>=0SEA
>-----END PGP SIGNATURE-----
>
>------------------------------------------------------------------------------
>Check out the vibrant tech community on one of the world's most
>engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>_______________________________________________
>Dar-libdar_api mailing list
>Dar...@li...
>https://lists.sourceforge.net/lists/listinfo/dar-libdar_api

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Tobias S. <spe...@gm...> - 2018-08-10 13:50:59

Hi Dennis,

learning by example is always the best.
Learning and making life easier was also my motivation for writing Gdar.
I haven't looked on dar_manager in detail yet, but a decent versioning of archived files is great. Would be nice to have a GUI there :)

Hope Gdar worked for you. Haven't had much time in between to work on it.

Yes, my backup solution would also be based on libdar.

The inhibit shutdown topic is unfortunately not a simple one.
A solution for all Desktop environments and Distributions would of course be preferable.
But I also like a convenient user experience. I would be happy to have a nice fitting solution for KDE and Gnome, for all other environments there could be a backup solution.
The thing with overwriting the close event, like Editors are doing it, works well. But it requires a program Window. As far as I know, this does not work with a background process, without having a permanent Window.
A solution based on systemd would be nice, but the inhibit shutdown does only work if the shutdown is triggered directly via systemd. I could not get it working with the shutdown command, or if the shutdown is initiated via the desktop.

Do you see my point?
But I think this discussion is nothing special about the libdar API :)

Best regards,
Tobias

Am 10. August 2018 12:33:21 MESZ schrieb Dennis Katsonis <de...@ne...>:
>On 08/10/2018 05:13 PM, Tobias Specht wrote:
>> Hi Dennis,
>> 
>> nice to hear you are planing to write a backup application with
>libdar.
>> I'm using the libdar API with my small tool Gdar myself:
>> https://github.com/peckto/gdar
>> http://www.peckto.de/gdar/gdar (Webseite having currently some
>linking 
>> problems...)
>> It's GTK though. Gdar can only extract dar archives at the moment.
>> Feel free to work with my code, or use it as an libdar example :)
>> 
>
>
>
>> I'm planing a larger tool with automated backup too.
>> But I'm still in the planing phase...
>> Trying to solve some general problems with backups in desktop
>environments,
>> like inhibit/delay shutdown:
>> https://forum.kde.org/viewtopic.php?f=305&t=141575 
>> If you have an idea on it, let me know.
>> 
>
>Thank you.  Even though I've started, I'm still thinking it might be
>better to work with an existing program than create yet another
>program.
> I'm a self-taught hobby programmer, and part of the motive is simply
>having something maybe worthwhile to work on.  It's actually
>dar_manager
>and the database which interests me more, and allowing the user to
>easily see which versions of which files are in their backup, and
>restore them.  Restoration from backups is more often to recover a
>small
>number of accidentally deleted or overwritten files, than full system
>restores, or perhaps to see a file as it was some time ago.
>
>I played around with GDar a couple of years ago or so.
>
>Is this tool you are working on going to be libdar based?
>
>As for the shutdown, I don't know how to do it under KDE, and I don't
>think you can (with good reason).  The answer provided in that thread,
>where I presume the mainwindow overrides the closeEvent slot to at
>least
>give a warning is probably the best one.
>
>The problem is, if it were possible, it would only work under KDE.  Run
>the software under FVWM, or Fluxbox, or something else, and it won't
>inhibit closure of the windowing system.  This would lead to users
>having incomplete backups, possibly without them knowing.  It also
>doesn't inhibit closure from another user logged in, or the root user.
>
>I would use systemd-inhibit.
>
>See the second answer here.
>https://unix.stackexchange.com/questions/34489/how-to-disable-shutdown-so-that-an-important-process-cannot-be-interrupted#264745
>
>
>> About your initial problem, I'm using fedora and dar as an rpm
>myself.
>> Did you write the packet maintainer about the compiler flag?
>> Maybe he can add it to the building instructions.
>> 
>
>Yes, I did make the suggestion.
>
>> Hi Denis :)
>> 
>> nice to hear you are improving the libdar API.
>> Will have a look on it.
>> 
>> Best regards,
>> Tobias
>> 
>> Am Donnerstag, 9. August 2018, 22:29:11 CEST schrieb Denis Corbin:
>>> On 09/08/2018 13:37, Dennis Katsonis wrote:
>>>> Hello,
>>>
>>> Hello Dennis,
>>>
>>>> I am developing a front end for Dar which is intended not just to
>>>> provide a graphical way of creating archive, but also provide
>>>> basic backup management.  The application will be written using the
>>>> Qt toolkit and using libdar directly.
>>>
>>> nice! :)
>>>
>>> Be aware that next to come major release 2.6.0 brings some API
>>> re-design to simplify the use (less libdar specify auxiliary types)
>>> and added new features. though there will be the same API in the
>>> specific 'libdar5' namespace and I will be available to help you
>>> migrating to the API v6 upon request.
>>>
>>>> I note that the version of dar compiled for Fedora uses the
>>>> unlimited integer size.  The performance of dar on archives with
>>>> large numbers of files is not satisfactory, and would unfortunately
>>>> also mean that the graphical application would stall and delay.
>>>
>>> this is know limitation of the 'infinint' dar/libdar flavor
>>> http://dar.linux.free.fr/doc/Limitations.html
>>>
>>>> The following command on an archive containing about 1 million
>>>> files takes 10 minutes.
>>>>
>>>> $ time /usr/bin/dar -l root > /dev/null
>>>>
>>>> /usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107%
>>>> cpu 9:31.64 total
>>>>
>>>> Memory usage peaks at 2124MB.
>>>>
>>>>
>>>> This delay is seen when listing, when scanning a reference archive
>>>> when creating a differential backup or when adding the archive to
>>>> a dar_manager database.  It also causes a delay when extracting a
>>>> file, which kind of defeats the purpose of having random access to
>>>> files. It would probably take as long to extract a file from a
>>>> compressed tarball.
>>>>
>>>> It also means that dar cannot complete a backup of my root
>>>> directory on my laptop with 2G of RAM.
>>>>
>>>> I compiled dar 2.5.16 with the --enable-mode=64 option, and the
>>>> performance greatly increased.
>>>>
>>>> For the exact same archive, using 64 bit integers.
>>>>
>>>> $ time /usr/bin/dar -l root > /dev/null
>>>>
>>>> dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253
>>>> total
>>>>
>>>> A 20x speed increase.
>>>>
>>>> Memory usage peaked at 879MB, still high, but far better.
>>>> dar_manager operations were faster, but still slow.
>>>>
>>>> For smaller archives, the difference is less noticable.  It seems
>>>> that dar operations increase exponentially in CPU time as the
>>>> number of files increase.  For smaller archives, the difference was
>>>> less noticeable, but still there.
>>>
>>> the memory requirement is not exponential but proportional to the
>>> number of file saved. The CPU requirement is rawly proportional to
>the
>>> volume of data to treat (CRC computation, compression, encryption,
>>> ...). This is true for both 64 bits and infinint flavors, though the
>>> infinint flavor does not rely on CPU integer operation where from
>its
>>> slowness.
>>>
>>>> The dar website seems to suggest that the cost of infinint is
>>>> modest, but my testing indicates that for what would be a regular
>>>> backup scenario, the cost is high.
>>>
>>> Where have you read that? This should be an error to be fixed.
>>>
>>>> Looking at the page listing the limitations, the limitations of 64
>>>> bit integers seems to far, far exceed what is required, and what
>>>> technology today can support anyway, and likely what technology
>>>> for many years to come can support.
>>>>
>>>> I suggest that inifint as an integer type should not be the
>>>> default.
>>>
>>> ... that's to be considered, though there is warning at compilation
>>> time when you compile using infinint... thus, if the one that
>compile
>>> does not even read that warning, he will neither read documentation,
>>> limitations and will blindly complain for any problem he will meet,
>>> such people drain a lot of time and are always unsatisfied at the
>>> end... so that's usually a good thing for me they do not use dar, it
>>> saves me time to do more interesting things than trying to justify
>and
>>> explain ...
>>>
>>>> It add in some cases unacceptable costs for no practical gain.
>>>> While some distributors compile with 64 bit integers (MacOSX brew),
>>>> other use the default (Fedora) which leads to a dar binary which
>>>> people may consider broken or buggy.
>>>
>>> Well, that's correct...
>>>
>>>> My other question is that the API uses infinint for values
>>>> internally. How does a libdar compiled with 64 bit integers impact
>>>> what is returned from methods returning an infinint?
>>>
>>> infinint and 64 bits flavors only differ by the way the "infinint"
>>> class is implemented. infinint is an alias (typedef if you prefer)
>to
>>> either "class real_infinint" or "class limitint" (32 or 64 bits
>>> integer underneath). Both classes have the same interface with the
>>> reste of libdar, only their implementation differ.
>>>
>>> There is still infinint class up to the API... in APIv6 I've pushed
>>> away a lot of internal types (including infinint) using pimpl idiom
>>> for some classes, but that was too complicated or it would have
>>> impacted performances to do it for all API related classes... thus
>the
>>> API remains indirectly dependent on either real_infinint/limitint
>>> class used in libdar. I other words, if you program has been
>>> dynamically linked with libdar64 it won't be possible to have it
>>> dynamically linked with libdar (relying on infinint), at least
>today.
>>> I have not done the test with APIv6 but I pretty sure it wont work.
>>>
>>>> I plan to possible use a linked-in libdar compiled with 64 bit
>>>> integers to ensure good performance.  Does infinint convert
>>>> internall from a native 64 bit to an infinint type?
>>>
>>> Not exactly. both classes (limitint and real_infinint) do the same
>>> thing in particular they store/read integers the same way into a dar
>>> archive. So the resulting archive is the same. If an integer is too
>>> large to be handled by class limitint, the class will detect
>overflow
>>> during arithmetic operation or while reading integer from an archive
>>> and libdar will abort with an Elimitint exception.
>>>
>>> Historically libdar relied on real_infinint (class was named
>infinint
>>> at that time) but due to poor performances the limitint class has
>been
>>> created to be substitued to class infinint (now renamed as
>>> real_infinint). Internally dar does not directly manipulate 64 bits
>>> intergers, for dates, for sizes, for offset, for anything, unless
>when
>>> dealing with system calls and library calls, where the "infinint"
>>> class has the ability to convert from and to those classical integer
>>> types like size_t and the like.
>>>
>>> So if your plan is to statically link your program with libdar64
>there
>>> is no issue, it will work flawlessly.
>>>
>>>> Thanks, Dennis
>>>
>>> Cheers,
>>> Denis
>>>
>>>
>> 
>> 
>> 
>> 
>>
>------------------------------------------------------------------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>> _______________________________________________
>> Dar-libdar_api mailing list
>> Dar...@li...
>> https://lists.sourceforge.net/lists/listinfo/dar-libdar_api
>>

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Denis C. <dar...@fr...> - 2018-08-10 12:09:19

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 10/08/2018 12:00, Dennis Katsonis wrote:
> On 08/10/2018 06:29 AM, Denis Corbin wrote:
>> On 09/08/2018 13:37, Dennis Katsonis wrote:
>>> Hello,
> 
>> Hello Dennis,
> 
> 
>>> I am developing a front end for Dar which is intended not just
>>> to provide a graphical way of creating archive, but also
>>> provide basic backup management.  The application will be
>>> written using the Qt toolkit and using libdar directly.
> 
>> nice! :)
> 
> 
> I am still in two minds about it.  It is intended to be 
> backup-focused, for those who like manual, simple backups, but
> with more emphasis on making restoration and viewing and managing
> the state of the backups easy and visible.  I'm not sure whether
> this should be the separate project I've started, or a contribution
> to KDar which includes dar_database style management
> functionality.

I can't tell you what's the best direction here, but I just remember
since Johnathan K. Burchill developed kdar (started in 2003) that
there was some issues with the new KDE version some years later (I
guess it was KDE 4)... issues that have not been solved AFAIK (correct
me if I'm wrong).

> 
> I should give my thanks to you for creating dar, as I was looking
> for a replacement for dump/restore and it met all my needs.
> Simple, easy differential backup, able to do ad-hoc backups,
> backups in manageable file archives, encryption and can reliably
> save ALL the files attributes easily.  It is a software package
> where you can tell the author has given a lot of thought to how it
> might be used and what people might want to do, and accommodated
> for that and documented it wel l.

Thanks!

> 

[...]

> 
>> the memory requirement is not exponential but proportional to the
>>  number of file saved. The CPU requirement is rawly proportional
>> to the volume of data to treat (CRC computation, compression, 
>> encryption, ...). This is true for both 64 bits and infinint 
>> flavors, though the infinint flavor does not rely on CPU integer 
>> operation where from its slowness.
> 
> 
> 
> Thank you for the clarification.  I didn't do the math over
> archives of different sizes, and went by my initial impression.

what I mentioned is theory. If system starts swapping the performance
degrades even faster while archive memory requirement increases, of
course...

> 
>>> The dar website seems to suggest that the cost of infinint is 
>>> modest, but my testing indicates that for what would be a 
>>> regular backup scenario, the cost is high.
> 
>> Where have you read that? This should be an error to be fixed.
> 
> 
> In the FAQ about a 'slight' penalty.
> 
> http://dar.linux.free.fr/doc/FAQ.html
> 
> Under "What slice size can I use with dar?" it says "thanks to its 
> internal own integer type named "infinint" dar is able to handle 
> arbitrarily large integers. This has a slightly memory and CPU
> penalty in regard to using native computer 32 or 64 bits integers,
> but has the advantage to provide a long term implementation in
> dar."

I will fix that, this is not correct, thanks for feedback.

[...]
> 
>>> It add in some cases unacceptable costs for no practical gain.
>>>  While some distributors compile with 64 bit integers (MacOSX 
>>> brew), other use the default (Fedora) which leads to a dar
>>> binary which people may consider broken or buggy.
> 
>> Well, that's correct...
> 
> 
> My first impression was a bug, and I was looking to migrate away
> from using dar until I did some web searching and thought that
> maybe the integer type was more significant than it might appear.
> I was a little concerned it might turn people off using dar if they
> find the program seems to hang for 10 minutes or so while restoring
> a single file .

I will probably follow your suggestion for release 2.6.0...

> 
> I'm not sure under what circumstances anyone would reach the
> stated limits though, unless I'm reading the website wrong and the
> file size limits are not 18EB but smaller.

you are reading correctly, limit can will be reached on any system if
the archive size (even split in many smaller slices) reaches 18EB.

I got positive feedback of several hundred TB archives, I think some
other guys reached the petabyte for their need some years ago, so we
are still far from the 18 exabyte... :-)

[...]

Cheers,
Denis
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIVAwUBW22AZQgxsL0D2LGCAQip8A/+MaaD8ROdfv+jeCXlXisu+9cZws2v8IAz
F5bwPejEx1KMAXhIskeVnN0VOkiTj68pENeO9i34NHKCS3gCnUi+FJMnY48D5c7i
zReoRnIjFDIdZjNpresQyEFwEm91/+vucEA18WA38ZjEjHAi3kXUr2jpj42wGVby
YOX88lFXw0/5kWpvcMUf9h4H0y5HhKnfB8BOoxB4cbKO2xI5BhOD6+Df6Cso1iY2
pMXdxz9guYkjVoVhVe60+O9sKu5I0F8+iWuWOTUWh8s4kvJITxkCzZxeuHxYmXF5
XxFYR7KgZ8rwOxabQi0cvA62J+Mvi/GHPwhwGTLuo+qRI8HUz2y+5XOIBl3cfrnx
n6o5lFxl6X02k4AQoUOQ/isFUpNb7HEwm8ef2igLSLOiT9IKzJUALHe2IN1oj5G/
Ra7S0ixUViWCeHK1uuK2kk28AnJQY9Wn6eh1562j+ZZNmX3yVrDcQ44llQlHy+rr
ysmKNKwvtVP9V62HpAFUie+XtFiu9ZVK0jqo9vLB+huWsoQc0KnQi6mz3G9rMaV4
g+cnfg+STAY74pQvhalcrdtusGH//c/9BJfjYEN9LG/VFSonIsWyqIrLRIkCOo3e
3+iBeyisDrp9R8C3Bya1m1TtCcB7fh0EMxdPVg6iWuLH/RwjiyRX9gP53LLwMouN
7jB+WVoFJCg=
=H5f3
-----END PGP SIGNATURE-----

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Denis C. <dar...@fr...> - 2018-08-10 11:49:17

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 10/08/2018 09:13, Tobias Specht wrote:
[...]
> 
> Hi Denis :)

Hi Tobias! :)

> 
> nice to hear you are improving the libdar API. Will have a look on
> it.

sure, so far all new API info is available GIT master, the
doc/API_tutorial.html is up to date for example. I have removed all
the pure C calls, replaces as much as possible arguments with standard
types. In parallel, I also plan to make a C binding for those that do
not want exceptions and classes... also planning a python-3 binding a
bit like what Wesley Leggette did some years ago with python-2, but
due to the longly expected 2.6.0 release this bindings will come right
after this new major release.

If you need any help to have gdar working with next to come release
2.6.0 tell me, I can provide support, of course!

> 
> Best regards, Tobias

Best Regards,
Denis

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIVAwUBW217tAgxsL0D2LGCAQh1+BAAkgKJeY+Sr6D1H9E8NO2F5Z5ksOnQe3TN
4s2PSVHRJuIAC1PU60Vl3RIcPtrUyO3XZuRLBdkLwRZlSZYtD69w20FODf7YfwB4
OiSAnsZP6dMkMjuNgBh7hB1/chIEnkZSmxtlqFCOwO+f+Ki1Xz2/CAARHzd3NgPo
B0kUvBlVvNEFYFdyIh5uaMsbFbiUKkdR4QOZ/ROtdo0SfVoG+dRhcc7xu76aGnYw
ylzVpnZmB02krd6XVpgMp4mnQLzCmjvD8cLAbUfBR2mU52McvJmlcPLSaRhp34xI
XHai5LLAiSbykY3Sb64yb/7HQT/ZeHoYbsmMD1EeLEg0ohYN6GoPxGwWTrnDZ/H4
jM0K5oZmVEG8goQn1g3+juHW+T0i2Yxd4mSkPiryUxoJFzj8DmSww/AzEppeX+B+
OCk9vo6wegwXtHcv7nETX9pezMYN2HaFhLL/MzdOM3z4X2SZc5UEA0SZaI29IqUW
6tFDRsXnS0GQsZ4TgBNEc1vm1lDRXgYNpZuESbQNvkizjIUcZiHCk3Ri6ViY2n7M
9Q72o4hzroAx0YOF967FFdQj/8AXKzoMhWup/wCYiSsVCDBJjHtHcNXrvTGgEF2W
ZgBrwdRVJZ4ClVGjEgWlM996TKE2AmdfngHZrYujO0FAY/eRBpxYmOILC0by1CaK
g2e1qODzwrM=
=0SEA
-----END PGP SIGNATURE-----

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Dennis K. <de...@ne...> - 2018-08-10 10:42:13

Attachments: signature.asc

On 08/10/2018 05:13 PM, Tobias Specht wrote:
> Hi Dennis,
> 
> nice to hear you are planing to write a backup application with libdar.
> I'm using the libdar API with my small tool Gdar myself:
> https://github.com/peckto/gdar
> http://www.peckto.de/gdar/gdar (Webseite having currently some linking 
> problems...)
> It's GTK though. Gdar can only extract dar archives at the moment.
> Feel free to work with my code, or use it as an libdar example :)
> 



> I'm planing a larger tool with automated backup too.
> But I'm still in the planing phase...
> Trying to solve some general problems with backups in desktop environments,
> like inhibit/delay shutdown:
> https://forum.kde.org/viewtopic.php?f=305&t=141575 
> If you have an idea on it, let me know.
> 

Thank you.  Even though I've started, I'm still thinking it might be
better to work with an existing program than create yet another program.
 I'm a self-taught hobby programmer, and part of the motive is simply
having something maybe worthwhile to work on.  It's actually dar_manager
and the database which interests me more, and allowing the user to
easily see which versions of which files are in their backup, and
restore them.  Restoration from backups is more often to recover a small
number of accidentally deleted or overwritten files, than full system
restores, or perhaps to see a file as it was some time ago.

I played around with GDar a couple of years ago or so.

Is this tool you are working on going to be libdar based?

As for the shutdown, I don't know how to do it under KDE, and I don't
think you can (with good reason).  The answer provided in that thread,
where I presume the mainwindow overrides the closeEvent slot to at least
give a warning is probably the best one.

The problem is, if it were possible, it would only work under KDE.  Run
the software under FVWM, or Fluxbox, or something else, and it won't
inhibit closure of the windowing system.  This would lead to users
having incomplete backups, possibly without them knowing.  It also
doesn't inhibit closure from another user logged in, or the root user.

I would use systemd-inhibit.

See the second answer here.
https://unix.stackexchange.com/questions/34489/how-to-disable-shutdown-so-that-an-important-process-cannot-be-interrupted#264745


> About your initial problem, I'm using fedora and dar as an rpm myself.
> Did you write the packet maintainer about the compiler flag?
> Maybe he can add it to the building instructions.
> 

Yes, I did make the suggestion.

> Hi Denis :)
> 
> nice to hear you are improving the libdar API.
> Will have a look on it.
> 
> Best regards,
> Tobias
> 
> Am Donnerstag, 9. August 2018, 22:29:11 CEST schrieb Denis Corbin:
>> On 09/08/2018 13:37, Dennis Katsonis wrote:
>>> Hello,
>>
>> Hello Dennis,
>>
>>> I am developing a front end for Dar which is intended not just to
>>> provide a graphical way of creating archive, but also provide
>>> basic backup management.  The application will be written using the
>>> Qt toolkit and using libdar directly.
>>
>> nice! :)
>>
>> Be aware that next to come major release 2.6.0 brings some API
>> re-design to simplify the use (less libdar specify auxiliary types)
>> and added new features. though there will be the same API in the
>> specific 'libdar5' namespace and I will be available to help you
>> migrating to the API v6 upon request.
>>
>>> I note that the version of dar compiled for Fedora uses the
>>> unlimited integer size.  The performance of dar on archives with
>>> large numbers of files is not satisfactory, and would unfortunately
>>> also mean that the graphical application would stall and delay.
>>
>> this is know limitation of the 'infinint' dar/libdar flavor
>> http://dar.linux.free.fr/doc/Limitations.html
>>
>>> The following command on an archive containing about 1 million
>>> files takes 10 minutes.
>>>
>>> $ time /usr/bin/dar -l root > /dev/null
>>>
>>> /usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107%
>>> cpu 9:31.64 total
>>>
>>> Memory usage peaks at 2124MB.
>>>
>>>
>>> This delay is seen when listing, when scanning a reference archive
>>> when creating a differential backup or when adding the archive to
>>> a dar_manager database.  It also causes a delay when extracting a
>>> file, which kind of defeats the purpose of having random access to
>>> files. It would probably take as long to extract a file from a
>>> compressed tarball.
>>>
>>> It also means that dar cannot complete a backup of my root
>>> directory on my laptop with 2G of RAM.
>>>
>>> I compiled dar 2.5.16 with the --enable-mode=64 option, and the
>>> performance greatly increased.
>>>
>>> For the exact same archive, using 64 bit integers.
>>>
>>> $ time /usr/bin/dar -l root > /dev/null
>>>
>>> dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253
>>> total
>>>
>>> A 20x speed increase.
>>>
>>> Memory usage peaked at 879MB, still high, but far better.
>>> dar_manager operations were faster, but still slow.
>>>
>>> For smaller archives, the difference is less noticable.  It seems
>>> that dar operations increase exponentially in CPU time as the
>>> number of files increase.  For smaller archives, the difference was
>>> less noticeable, but still there.
>>
>> the memory requirement is not exponential but proportional to the
>> number of file saved. The CPU requirement is rawly proportional to the
>> volume of data to treat (CRC computation, compression, encryption,
>> ...). This is true for both 64 bits and infinint flavors, though the
>> infinint flavor does not rely on CPU integer operation where from its
>> slowness.
>>
>>> The dar website seems to suggest that the cost of infinint is
>>> modest, but my testing indicates that for what would be a regular
>>> backup scenario, the cost is high.
>>
>> Where have you read that? This should be an error to be fixed.
>>
>>> Looking at the page listing the limitations, the limitations of 64
>>> bit integers seems to far, far exceed what is required, and what
>>> technology today can support anyway, and likely what technology
>>> for many years to come can support.
>>>
>>> I suggest that inifint as an integer type should not be the
>>> default.
>>
>> ... that's to be considered, though there is warning at compilation
>> time when you compile using infinint... thus, if the one that compile
>> does not even read that warning, he will neither read documentation,
>> limitations and will blindly complain for any problem he will meet,
>> such people drain a lot of time and are always unsatisfied at the
>> end... so that's usually a good thing for me they do not use dar, it
>> saves me time to do more interesting things than trying to justify and
>> explain ...
>>
>>> It add in some cases unacceptable costs for no practical gain.
>>> While some distributors compile with 64 bit integers (MacOSX brew),
>>> other use the default (Fedora) which leads to a dar binary which
>>> people may consider broken or buggy.
>>
>> Well, that's correct...
>>
>>> My other question is that the API uses infinint for values
>>> internally. How does a libdar compiled with 64 bit integers impact
>>> what is returned from methods returning an infinint?
>>
>> infinint and 64 bits flavors only differ by the way the "infinint"
>> class is implemented. infinint is an alias (typedef if you prefer) to
>> either "class real_infinint" or "class limitint" (32 or 64 bits
>> integer underneath). Both classes have the same interface with the
>> reste of libdar, only their implementation differ.
>>
>> There is still infinint class up to the API... in APIv6 I've pushed
>> away a lot of internal types (including infinint) using pimpl idiom
>> for some classes, but that was too complicated or it would have
>> impacted performances to do it for all API related classes... thus the
>> API remains indirectly dependent on either real_infinint/limitint
>> class used in libdar. I other words, if you program has been
>> dynamically linked with libdar64 it won't be possible to have it
>> dynamically linked with libdar (relying on infinint), at least today.
>> I have not done the test with APIv6 but I pretty sure it wont work.
>>
>>> I plan to possible use a linked-in libdar compiled with 64 bit
>>> integers to ensure good performance.  Does infinint convert
>>> internall from a native 64 bit to an infinint type?
>>
>> Not exactly. both classes (limitint and real_infinint) do the same
>> thing in particular they store/read integers the same way into a dar
>> archive. So the resulting archive is the same. If an integer is too
>> large to be handled by class limitint, the class will detect overflow
>> during arithmetic operation or while reading integer from an archive
>> and libdar will abort with an Elimitint exception.
>>
>> Historically libdar relied on real_infinint (class was named infinint
>> at that time) but due to poor performances the limitint class has been
>> created to be substitued to class infinint (now renamed as
>> real_infinint). Internally dar does not directly manipulate 64 bits
>> intergers, for dates, for sizes, for offset, for anything, unless when
>> dealing with system calls and library calls, where the "infinint"
>> class has the ability to convert from and to those classical integer
>> types like size_t and the like.
>>
>> So if your plan is to statically link your program with libdar64 there
>> is no issue, it will work flawlessly.
>>
>>> Thanks, Dennis
>>
>> Cheers,
>> Denis
>>
>>
> 
> 
> 
> 
> ------------------------------------------------------------------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> Dar-libdar_api mailing list
> Dar...@li...
> https://lists.sourceforge.net/lists/listinfo/dar-libdar_api
>

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Dennis K. <de...@ne...> - 2018-08-10 10:09:27

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 08/10/2018 06:29 AM, Denis Corbin wrote:
> On 09/08/2018 13:37, Dennis Katsonis wrote:
>> Hello,
> 
> Hello Dennis,
> 
> 
>> I am developing a front end for Dar which is intended not just to
>>  provide a graphical way of creating archive, but also provide 
>> basic backup management.  The application will be written using
>> the Qt toolkit and using libdar directly.
> 
> nice! :)
> 

I am still in two minds about it.  It is intended to be
backup-focused, for those who like manual, simple backups, but with
more emphasis on making restoration and viewing and managing the state
of the backups easy and visible.  I'm not sure whether this should be
the separate project I've started, or a contribution to KDar which
includes dar_database style management functionality.

I should give my thanks to you for creating dar, as I was looking for
a replacement for dump/restore and it met all my needs.  Simple, easy
differential backup, able to do ad-hoc backups, backups in manageable
file archives, encryption and can reliably save ALL the files
attributes easily.  It is a software package where you can tell the
author has given a lot of thought to how it might be used and what
people might want to do, and accommodated for that and documented it wel
l.


> Be aware that next to come major release 2.6.0 brings some API 
> re-design to simplify the use (less libdar specify auxiliary
> types) and added new features. though there will be the same API in
> the specific 'libdar5' namespace and I will be available to help
> you migrating to the API v6 upon request.
> 
> 
>> I note that the version of dar compiled for Fedora uses the 
>> unlimited integer size.  The performance of dar on archives with 
>> large numbers of files is not satisfactory, and would
>> unfortunately also mean that the graphical application would
>> stall and delay.
> 
> this is know limitation of the 'infinint' dar/libdar flavor 
> http://dar.linux.free.fr/doc/Limitations.html
> 
> 
> 
>> The following command on an archive containing about 1 million 
>> files takes 10 minutes.
> 
>> $ time /usr/bin/dar -l root > /dev/null
> 
>> /usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107% 
>> cpu 9:31.64 total
> 
>> Memory usage peaks at 2124MB.
> 
> 
>> This delay is seen when listing, when scanning a reference
>> archive when creating a differential backup or when adding the
>> archive to a dar_manager database.  It also causes a delay when
>> extracting a file, which kind of defeats the purpose of having
>> random access to files. It would probably take as long to extract
>> a file from a compressed tarball.
> 
>> It also means that dar cannot complete a backup of my root 
>> directory on my laptop with 2G of RAM.
> 
>> I compiled dar 2.5.16 with the --enable-mode=64 option, and the 
>> performance greatly increased.
> 
>> For the exact same archive, using 64 bit integers.
> 
>> $ time /usr/bin/dar -l root > /dev/null
> 
>> dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253 
>> total
> 
>> A 20x speed increase.
> 
>> Memory usage peaked at 879MB, still high, but far better. 
>> dar_manager operations were faster, but still slow.
> 
>> For smaller archives, the difference is less noticable.  It
>> seems that dar operations increase exponentially in CPU time as
>> the number of files increase.  For smaller archives, the
>> difference was less noticeable, but still there.
> 
> the memory requirement is not exponential but proportional to the 
> number of file saved. The CPU requirement is rawly proportional to
> the volume of data to treat (CRC computation, compression,
> encryption, ...). This is true for both 64 bits and infinint
> flavors, though the infinint flavor does not rely on CPU integer
> operation where from its slowness.
> 
> 

Thank you for the clarification.  I didn't do the math over archives
of different sizes, and went by my initial impression.
> 
>> The dar website seems to suggest that the cost of infinint is 
>> modest, but my testing indicates that for what would be a
>> regular backup scenario, the cost is high.
> 
> Where have you read that? This should be an error to be fixed.
> 

In the FAQ about a 'slight' penalty.

http://dar.linux.free.fr/doc/FAQ.html

Under "What slice size can I use with dar?" it says "thanks to its
internal own integer type named "infinint" dar is able to handle
arbitrarily large integers. This has a slightly memory and CPU penalty
in regard to using native computer 32 or 64 bits integers, but has the
advantage to provide a long term implementation in dar."



> 
>> Looking at the page listing the limitations, the limitations of
>> 64 bit integers seems to far, far exceed what is required, and
>> what technology today can support anyway, and likely what
>> technology for many years to come can support.
> 
>> I suggest that inifint as an integer type should not be the 
>> default.
> 
> ... that's to be considered, though there is warning at
> compilation time when you compile using infinint... thus, if the
> one that compile does not even read that warning, he will neither
> read documentation, limitations and will blindly complain for any
> problem he will meet, such people drain a lot of time and are
> always unsatisfied at the end... so that's usually a good thing for
> me they do not use dar, it saves me time to do more interesting
> things than trying to justify and explain ...


> 
>> It add in some cases unacceptable costs for no practical gain. 
>> While some distributors compile with 64 bit integers (MacOSX
>> brew), other use the default (Fedora) which leads to a dar binary
>> which people may consider broken or buggy.
> 
> Well, that's correct...
> 
> 
My first impression was a bug, and I was looking to migrate away from
using dar until I did some web searching and thought that maybe the
integer type was more significant than it might appear.  I was a
little concerned it might turn people off using dar if they find the
program seems to hang for 10 minutes or so while restoring a single file
.

I'm not sure under what circumstances anyone would reach the stated
limits though, unless I'm reading the website wrong and the file size
limits are not 18EB but smaller.

>> My other question is that the API uses infinint for values 
>> internally. How does a libdar compiled with 64 bit integers
>> impact what is returned from methods returning an infinint?
> 
> infinint and 64 bits flavors only differ by the way the "infinint" 
> class is implemented. infinint is an alias (typedef if you prefer)
> to either "class real_infinint" or "class limitint" (32 or 64 bits 
> integer underneath). Both classes have the same interface with the 
> reste of libdar, only their implementation differ.
> 
> There is still infinint class up to the API... in APIv6 I've
> pushed away a lot of internal types (including infinint) using
> pimpl idiom for some classes, but that was too complicated or it
> would have impacted performances to do it for all API related
> classes... thus the API remains indirectly dependent on either
> real_infinint/limitint class used in libdar. I other words, if you
> program has been dynamically linked with libdar64 it won't be
> possible to have it dynamically linked with libdar (relying on
> infinint), at least today. I have not done the test with APIv6 but
> I pretty sure it wont work.
> 
>> I plan to possible use a linked-in libdar compiled with 64 bit 
>> integers to ensure good performance.  Does infinint convert 
>> internall from a native 64 bit to an infinint type?
> Not exactly. both classes (limitint and real_infinint) do the same 
> thing in particular they store/read integers the same way into a
> dar archive. So the resulting archive is the same. If an integer is
> too large to be handled by class limitint, the class will detect
> overflow during arithmetic operation or while reading integer from
> an archive and libdar will abort with an Elimitint exception.
> 
> Historically libdar relied on real_infinint (class was named
> infinint at that time) but due to poor performances the limitint
> class has been created to be substitued to class infinint (now
> renamed as real_infinint). Internally dar does not directly
> manipulate 64 bits intergers, for dates, for sizes, for offset, for
> anything, unless when dealing with system calls and library calls,
> where the "infinint" class has the ability to convert from and to
> those classical integer types like size_t and the like.
> 
> So if your plan is to statically link your program with libdar64
> there is no issue, it will work flawlessly.
> 

> 
>> Thanks, Dennis
> 
> 
> 
> Cheers, Denis
> 
> 
> ----------------------------------------------------------------------
- --------
>
> 
Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot 
> _______________________________________________ Dar-libdar_api
> mailing list Dar...@li... 
> https://lists.sourceforge.net/lists/listinfo/dar-libdar_api
> 

-----BEGIN PGP SIGNATURE-----

iHUEAREIAB0WIQS9XnmVf3NcHCqygFfXi6TdSq3zSAUCW21iOgAKCRDXi6TdSq3z
SF2DAQDjrCoErr8lfl5k7HX3MWz5en5zO0MsSoqenw/jgyer7AD/ctIaQaJoXgeA
5gacgUNo1V7ebG5i1Xr8yLHZh5xvFcc=
=ToBl
-----END PGP SIGNATURE-----

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Tobias S. <spe...@gm...> - 2018-08-10 07:13:44

Hi Dennis,

nice to hear you are planing to write a backup application with libdar.
I'm using the libdar API with my small tool Gdar myself:
https://github.com/peckto/gdar
http://www.peckto.de/gdar/gdar (Webseite having currently some linking 
problems...)
It's GTK though. Gdar can only extract dar archives at the moment.
Feel free to work with my code, or use it as an libdar example :)

I'm planing a larger tool with automated backup too.
But I'm still in the planing phase...
Trying to solve some general problems with backups in desktop environments,
like inhibit/delay shutdown:
https://forum.kde.org/viewtopic.php?f=305&t=141575 
If you have an idea on it, let me know.

About your initial problem, I'm using fedora and dar as an rpm myself.
Did you write the packet maintainer about the compiler flag?
Maybe he can add it to the building instructions.

Hi Denis :)

nice to hear you are improving the libdar API.
Will have a look on it.

Best regards,
Tobias

Am Donnerstag, 9. August 2018, 22:29:11 CEST schrieb Denis Corbin:
> On 09/08/2018 13:37, Dennis Katsonis wrote:
> > Hello,
> 
> Hello Dennis,
> 
> > I am developing a front end for Dar which is intended not just to
> > provide a graphical way of creating archive, but also provide
> > basic backup management.  The application will be written using the
> > Qt toolkit and using libdar directly.
> 
> nice! :)
> 
> Be aware that next to come major release 2.6.0 brings some API
> re-design to simplify the use (less libdar specify auxiliary types)
> and added new features. though there will be the same API in the
> specific 'libdar5' namespace and I will be available to help you
> migrating to the API v6 upon request.
> 
> > I note that the version of dar compiled for Fedora uses the
> > unlimited integer size.  The performance of dar on archives with
> > large numbers of files is not satisfactory, and would unfortunately
> > also mean that the graphical application would stall and delay.
> 
> this is know limitation of the 'infinint' dar/libdar flavor
> http://dar.linux.free.fr/doc/Limitations.html
> 
> > The following command on an archive containing about 1 million
> > files takes 10 minutes.
> > 
> > $ time /usr/bin/dar -l root > /dev/null
> > 
> > /usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107%
> > cpu 9:31.64 total
> > 
> > Memory usage peaks at 2124MB.
> > 
> > 
> > This delay is seen when listing, when scanning a reference archive
> > when creating a differential backup or when adding the archive to
> > a dar_manager database.  It also causes a delay when extracting a
> > file, which kind of defeats the purpose of having random access to
> > files. It would probably take as long to extract a file from a
> > compressed tarball.
> > 
> > It also means that dar cannot complete a backup of my root
> > directory on my laptop with 2G of RAM.
> > 
> > I compiled dar 2.5.16 with the --enable-mode=64 option, and the
> > performance greatly increased.
> > 
> > For the exact same archive, using 64 bit integers.
> > 
> > $ time /usr/bin/dar -l root > /dev/null
> > 
> > dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253
> > total
> > 
> > A 20x speed increase.
> > 
> > Memory usage peaked at 879MB, still high, but far better.
> > dar_manager operations were faster, but still slow.
> > 
> > For smaller archives, the difference is less noticable.  It seems
> > that dar operations increase exponentially in CPU time as the
> > number of files increase.  For smaller archives, the difference was
> > less noticeable, but still there.
> 
> the memory requirement is not exponential but proportional to the
> number of file saved. The CPU requirement is rawly proportional to the
> volume of data to treat (CRC computation, compression, encryption,
> ...). This is true for both 64 bits and infinint flavors, though the
> infinint flavor does not rely on CPU integer operation where from its
> slowness.
> 
> > The dar website seems to suggest that the cost of infinint is
> > modest, but my testing indicates that for what would be a regular
> > backup scenario, the cost is high.
> 
> Where have you read that? This should be an error to be fixed.
> 
> > Looking at the page listing the limitations, the limitations of 64
> > bit integers seems to far, far exceed what is required, and what
> > technology today can support anyway, and likely what technology
> > for many years to come can support.
> > 
> > I suggest that inifint as an integer type should not be the
> > default.
> 
> ... that's to be considered, though there is warning at compilation
> time when you compile using infinint... thus, if the one that compile
> does not even read that warning, he will neither read documentation,
> limitations and will blindly complain for any problem he will meet,
> such people drain a lot of time and are always unsatisfied at the
> end... so that's usually a good thing for me they do not use dar, it
> saves me time to do more interesting things than trying to justify and
> explain ...
> 
> > It add in some cases unacceptable costs for no practical gain.
> > While some distributors compile with 64 bit integers (MacOSX brew),
> > other use the default (Fedora) which leads to a dar binary which
> > people may consider broken or buggy.
> 
> Well, that's correct...
> 
> > My other question is that the API uses infinint for values
> > internally. How does a libdar compiled with 64 bit integers impact
> > what is returned from methods returning an infinint?
> 
> infinint and 64 bits flavors only differ by the way the "infinint"
> class is implemented. infinint is an alias (typedef if you prefer) to
> either "class real_infinint" or "class limitint" (32 or 64 bits
> integer underneath). Both classes have the same interface with the
> reste of libdar, only their implementation differ.
> 
> There is still infinint class up to the API... in APIv6 I've pushed
> away a lot of internal types (including infinint) using pimpl idiom
> for some classes, but that was too complicated or it would have
> impacted performances to do it for all API related classes... thus the
> API remains indirectly dependent on either real_infinint/limitint
> class used in libdar. I other words, if you program has been
> dynamically linked with libdar64 it won't be possible to have it
> dynamically linked with libdar (relying on infinint), at least today.
> I have not done the test with APIv6 but I pretty sure it wont work.
> 
> > I plan to possible use a linked-in libdar compiled with 64 bit
> > integers to ensure good performance.  Does infinint convert
> > internall from a native 64 bit to an infinint type?
> 
> Not exactly. both classes (limitint and real_infinint) do the same
> thing in particular they store/read integers the same way into a dar
> archive. So the resulting archive is the same. If an integer is too
> large to be handled by class limitint, the class will detect overflow
> during arithmetic operation or while reading integer from an archive
> and libdar will abort with an Elimitint exception.
> 
> Historically libdar relied on real_infinint (class was named infinint
> at that time) but due to poor performances the limitint class has been
> created to be substitued to class infinint (now renamed as
> real_infinint). Internally dar does not directly manipulate 64 bits
> intergers, for dates, for sizes, for offset, for anything, unless when
> dealing with system calls and library calls, where the "infinint"
> class has the ability to convert from and to those classical integer
> types like size_t and the like.
> 
> So if your plan is to statically link your program with libdar64 there
> is no issue, it will work flawlessly.
> 
> > Thanks, Dennis
> 
> Cheers,
> Denis
> 
>

Re: [Dar-libdar_api] Poor performance of unlimited integer size

From: Denis C. <dar...@fr...> - 2018-08-09 20:29:27

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 09/08/2018 13:37, Dennis Katsonis wrote:
> Hello,

Hello Dennis,

> 
> I am developing a front end for Dar which is intended not just to 
> provide a graphical way of creating archive, but also provide
> basic backup management.  The application will be written using the
> Qt toolkit and using libdar directly.

nice! :)

Be aware that next to come major release 2.6.0 brings some API
re-design to simplify the use (less libdar specify auxiliary types)
and added new features. though there will be the same API in the
specific 'libdar5' namespace and I will be available to help you
migrating to the API v6 upon request.

> 
> I note that the version of dar compiled for Fedora uses the
> unlimited integer size.  The performance of dar on archives with
> large numbers of files is not satisfactory, and would unfortunately
> also mean that the graphical application would stall and delay.

this is know limitation of the 'infinint' dar/libdar flavor
http://dar.linux.free.fr/doc/Limitations.html

> 
> The following command on an archive containing about 1 million
> files takes 10 minutes.
> 
> $ time /usr/bin/dar -l root > /dev/null
> 
> /usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107%
> cpu 9:31.64 total
> 
> Memory usage peaks at 2124MB.
> 
> 
> This delay is seen when listing, when scanning a reference archive 
> when creating a differential backup or when adding the archive to
> a dar_manager database.  It also causes a delay when extracting a
> file, which kind of defeats the purpose of having random access to
> files. It would probably take as long to extract a file from a
> compressed tarball.
> 
> It also means that dar cannot complete a backup of my root
> directory on my laptop with 2G of RAM.
> 
> I compiled dar 2.5.16 with the --enable-mode=64 option, and the 
> performance greatly increased.
> 
> For the exact same archive, using 64 bit integers.
> 
> $ time /usr/bin/dar -l root > /dev/null
> 
> dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253
> total
> 
> A 20x speed increase.
> 
> Memory usage peaked at 879MB, still high, but far better.
> dar_manager operations were faster, but still slow.
> 
> For smaller archives, the difference is less noticable.  It seems
> that dar operations increase exponentially in CPU time as the
> number of files increase.  For smaller archives, the difference was
> less noticeable, but still there.

the memory requirement is not exponential but proportional to the
number of file saved. The CPU requirement is rawly proportional to the
volume of data to treat (CRC computation, compression, encryption,
...). This is true for both 64 bits and infinint flavors, though the
infinint flavor does not rely on CPU integer operation where from its
slowness.

> 
> The dar website seems to suggest that the cost of infinint is
> modest, but my testing indicates that for what would be a regular
> backup scenario, the cost is high.

Where have you read that? This should be an error to be fixed.

> 
> Looking at the page listing the limitations, the limitations of 64
> bit integers seems to far, far exceed what is required, and what 
> technology today can support anyway, and likely what technology
> for many years to come can support.
> 
> I suggest that inifint as an integer type should not be the
> default.

... that's to be considered, though there is warning at compilation
time when you compile using infinint... thus, if the one that compile
does not even read that warning, he will neither read documentation,
limitations and will blindly complain for any problem he will meet,
such people drain a lot of time and are always unsatisfied at the
end... so that's usually a good thing for me they do not use dar, it
saves me time to do more interesting things than trying to justify and
explain ...

> It add in some cases unacceptable costs for no practical gain.
> While some distributors compile with 64 bit integers (MacOSX brew),
> other use the default (Fedora) which leads to a dar binary which
> people may consider broken or buggy.

Well, that's correct...

> 
> My other question is that the API uses infinint for values
> internally. How does a libdar compiled with 64 bit integers impact
> what is returned from methods returning an infinint?

infinint and 64 bits flavors only differ by the way the "infinint"
class is implemented. infinint is an alias (typedef if you prefer) to
either "class real_infinint" or "class limitint" (32 or 64 bits
integer underneath). Both classes have the same interface with the
reste of libdar, only their implementation differ.

There is still infinint class up to the API... in APIv6 I've pushed
away a lot of internal types (including infinint) using pimpl idiom
for some classes, but that was too complicated or it would have
impacted performances to do it for all API related classes... thus the
API remains indirectly dependent on either real_infinint/limitint
class used in libdar. I other words, if you program has been
dynamically linked with libdar64 it won't be possible to have it
dynamically linked with libdar (relying on infinint), at least today.
I have not done the test with APIv6 but I pretty sure it wont work.

> I plan to possible use a linked-in libdar compiled with 64 bit
> integers to ensure good performance.  Does infinint convert
> internall from a native 64 bit to an infinint type?
Not exactly. both classes (limitint and real_infinint) do the same
thing in particular they store/read integers the same way into a dar
archive. So the resulting archive is the same. If an integer is too
large to be handled by class limitint, the class will detect overflow
during arithmetic operation or while reading integer from an archive
and libdar will abort with an Elimitint exception.

Historically libdar relied on real_infinint (class was named infinint
at that time) but due to poor performances the limitint class has been
created to be substitued to class infinint (now renamed as
real_infinint). Internally dar does not directly manipulate 64 bits
intergers, for dates, for sizes, for offset, for anything, unless when
dealing with system calls and library calls, where the "infinint"
class has the ability to convert from and to those classical integer
types like size_t and the like.

So if your plan is to statically link your program with libdar64 there
is no issue, it will work flawlessly.

> 
> Thanks, Dennis
> 
> 

Cheers,
Denis

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIVAwUBW2ykFwgxsL0D2LGCAQiaXw/+NuIGHOGUEFR56UXzsGYm39Nv+iM4Pyx7
5KA5Xqd/vsAf0pE4mLNFlwnNVqUUXUaK6899+UYymr/Y5+myvNlfRfFp0PLilMJI
pT8XNiMhI1P6lQL17lHQuvKCbl6K2B4cG9FwzmC35Eg3nVuxsKt+N5QCSEl8WzBd
j3alkO0hK/AnyHNC4LkJJ8sdNjOyQLiLoCrMVlw2baO5XwzAzKtV39lDiRpqF9su
4Wu28UkD8ITzE0cFTrlKuBAVqGWLYT9As1pQHhKuRMj61e3j5wRZ/B/wVqByJ4ic
rClXaMRDgT3rDRWkfrOo+smTeKnlPzkmn+EexztSF9V5T8rXiGxDqXp/Ik8RgfKl
DJvvtznmqGrzKqjsnDjzAvssZwEOcYJZST/BFyraNRLqdNmhQ4inMTZ2IOVf809f
EdbVhmW8wS2l+hJjlPVr3xjWkgX4vXVag6xJwKB9FIV4SqfwcdZUwIowJGbVIKax
xX9cTQYCqr7C5x+1JUsl14rvBBXGdBtApXD/K4QACUJYjVBr2+ne4wQiy0Dq/NUG
QeqFqgqcg1vypWjYuOVi9x/On/X0haSorp4GL51NDWS/7M8KBoDi1oo6EpwvBJVd
FLC43eDWP7yK6YN2Bn221LY+MnqohGcf7stNgPgWcoCLVjwNChxxyQkKGa0g3lV3
6DYX6KGzusQ=
=7hCG
-----END PGP SIGNATURE-----

[Dar-libdar_api] Poor performance of unlimited integer size

From: Dennis K. <de...@ne...> - 2018-08-09 12:07:27

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Hello,

I am developing a front end for Dar which is intended not just to
provide a graphical way of creating archive, but also provide basic
backup management.  The application will be written using the Qt
toolkit and using libdar directly.

I note that the version of dar compiled for Fedora uses the unlimited
integer size.  The performance of dar on archives with large numbers
of files is not satisfactory, and would unfortunately also mean that
the graphical application would stall and delay.

The following command on an archive containing about 1 million files
takes 10 minutes.

$ time /usr/bin/dar -l root > /dev/null

/usr/bin/dar -l root > /dev/null  615.30s user 1.81s system 107% cpu
9:31.64 total

Memory usage peaks at 2124MB.


This delay is seen when listing, when scanning a reference archive
when creating a differential backup or when adding the archive to a
dar_manager database.  It also causes a delay when extracting a file,
which kind of defeats the purpose of having random access to files.
It would probably take as long to extract a file from a compressed
tarball.

It also means that dar cannot complete a backup of my root directory
on my laptop with 2G of RAM.

I compiled dar 2.5.16 with the --enable-mode=64 option, and the
performance greatly increased.

For the exact same archive, using 64 bit integers.

$ time /usr/bin/dar -l root > /dev/null

dar -l root > /dev/null  28.89s user 0.48s system 97% cpu 30.253 total

A 20x speed increase.

Memory usage peaked at 879MB, still high, but far better.  dar_manager
operations were faster, but still slow.

For smaller archives, the difference is less noticable.  It seems that
dar operations increase exponentially in CPU time as the number of
files increase.  For smaller archives, the difference was less
noticeable, but still there.

The dar website seems to suggest that the cost of infinint is modest,
but my testing indicates that for what would be a regular backup
scenario, the cost is high.

Looking at the page listing the limitations, the limitations of 64 bit
integers seems to far, far exceed what is required, and what
technology today can support anyway, and likely what technology for
many years to come can support.

I suggest that inifint as an integer type should not be the default.
It add in some cases unacceptable costs for no practical gain.  While
some distributors compile with 64 bit integers (MacOSX brew), other
use the default (Fedora) which leads to a dar binary which people may
consider broken or buggy.

My other question is that the API uses infinint for values internally.
 How does a libdar compiled with 64 bit integers impact what is
returned from methods returning an infinint?  I plan to possible use a
linked-in libdar compiled with 64 bit integers to ensure good
performance.  Does infinint convert internall from a native 64 bit to
an infinint type?

Thanks,
Dennis
-----BEGIN PGP SIGNATURE-----

iHUEAREIAB0WIQS9XnmVf3NcHCqygFfXi6TdSq3zSAUCW2wnSAAKCRDXi6TdSq3z
SC2DAQDaySQTiVL/8UPGjazwEhUn3N8SrC7yjuRFNXgb6sMW+gEAoO9SUPoXQk2g
u4ldwdE8HdjDf/AGfskWWBIpnjM2oNY=
=G8bn
-----END PGP SIGNATURE-----

[Dar-libdar_api] policy modification at sourceforge

From: Denis C. <dar...@fr...> - 2017-09-09 11:56:34

Attachments: signature.asc

Hi all,

this mailing-list is hosted at Sourceforge where the policy has recently
changed: Subscribed users needed to manually resubscribed before August
not to be removed from the mailing-list. I have been removed myself so I
may have missed some support request since July. Well, it seems I missed
or did not paid attention to the notice sent by Sourceforge about that
new policy ...

... anyway, this mail has two main purposes:
- adding a trace in the mailing-list archive just in case
- and second, checking that the mailing-list to newsgroup gateway at
gmane.org, is still operational.

Sorry for inconvenience

Cheers,
Denis

Re: [Dar-libdar_api] dar version 2.5.x missing includes

From: Denis C. <dar...@fr...> - 2016-01-09 18:32:40

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Tobias,

Yep, I missed it, sorry. This now fixed in GIT and ready for next
release. Thanks for your feedback.

Regards,
Denis.


Le 08/01/2016 21:38, Tobias Specht a écrit :
> Hi Denis,
> 
> since dar version 2.5.1 including 2.5.3 I'm experiencing problems
> to include libdar in my project gdar. The compiler races the
> following error: In file included from
> /usr/include/dar/storage.hpp:29:0, from
> /usr/include/dar/real_infinint.hpp:43, from
> /usr/include/dar/infinint.hpp:31, from
> /usr/include/dar/compressor.hpp:31, from
> /usr/include/dar/libdar.hpp:77, from mylibdar.hpp:26, from
> gdar.cpp:22: /usr/include/dar/on_pool.hpp:37:45: fatal error: 
> /usr/include/dar/cygwin_adapt.hpp: No such file or directory 
> #include "/usr/include/dar/cygwin_adapt.hpp"
> 
> In dar version 2.5.3 cat_tools.hpp seams also to be missing.
> 
> The missing header files are part of the release but are not copied
> during installation.
> 
> I compiled dar from source like: ./configure --prefix=/usr make 
> make install
> 
> The system I'm using is LinuxMint 17.3, but the problem probably
> occurs on other systems as well.
> 
> The compiler option when including libdar is: `pkg-config --cflags
> libdar`
> 
> Do I miss some compiler or config options or are the files just
> missing?
> 
> Best regards, Tobias
> 
> ------------------------------------------------------------------------------
>
> 
Site24x7 APM Insight: Get Deep Visibility into Application Performance
> APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month 
> Monitor end-to-end web transactions and take corrective actions
> now Troubleshoot faster and improve end-user experience. Signup
> Now! 
> http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140 
> _______________________________________________ Dar-libdar_api
> mailing list Dar...@li... 
> https://lists.sourceforge.net/lists/listinfo/dar-libdar_api
> 
> 
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQIVAwUBVpFSQAgxsL0D2LGCAQJkFg/7B1YDF889t5IBrrGLcTpRCRUI+QkJ3h8B
V6RKkHjAvScWECD53fDbwh0VrQ9B2Vm25LlcLPxxg2uNLKFo6sWHKn2NqvSJNe34
/Qi3o3DGUP3uwzwvdlEE2bRa61cmJBzZ09tYI9hs4YfKLJDQOdaTOEE3iXwD6gte
tnbMwUGfAC2Wou9cdc6IkLAJrtYDNoKLkjNb8SDhJYkOQNZ6lIMTt7XdIzfUGrsP
xbVoVaC1lIs7xl3FCEnkUo/dQTtNGEm7a2wP+XMQiuOtI+HGu6MCflZElKz4mMZ7
KtBAgbo58CYkZRgIbUV7ppXLG0sdYd5qGFq+Ikhh+TLMug8BAAtiT2zbnelIgHOq
AuTuOfatUANxu/P3TBzMf7W1/+ZZ/csZbQbgfzOuggz1SAz/6ERyqIKPz1iO1+OL
86tcYG2mD2laXtbAt1UcQ2CYlFo0T2y6lXi7XPzolUyQnDD46/UJbtqqxH1VFciX
GmftQ5npKZ2RPAPjHUGCqd+yP/xmUM4F3jvuWKL/FVI6sMmMVRIMhcDvTHz1G/x2
yeV0Wx3XS7PzKzlIaABeSUozAs96TPM93hjIEYRMe+rwab7Y+3wYDBpD+bKnOZSw
qfwuAmx+FTeWdg5bDlHpwnJ48pSlgh5mQk6EHKAfDHpCjgYlBmF13uD73PpTEkr5
gUuEGMd8ZFc=
=F9ll
-----END PGP SIGNATURE-----

[Dar-libdar_api] dar version 2.5.x missing includes

From: Tobias S. <spe...@gm...> - 2016-01-08 20:38:44

Hi Denis,

since dar version 2.5.1 including 2.5.3 I'm experiencing problems to include 
libdar in my project gdar. The compiler races the following error:
In file included from /usr/include/dar/storage.hpp:29:0,
                 from /usr/include/dar/real_infinint.hpp:43,
                 from /usr/include/dar/infinint.hpp:31,
                 from /usr/include/dar/compressor.hpp:31,
                 from /usr/include/dar/libdar.hpp:77,
                 from mylibdar.hpp:26,
                 from gdar.cpp:22:
/usr/include/dar/on_pool.hpp:37:45: fatal error: 
/usr/include/dar/cygwin_adapt.hpp: No such file or directory
 #include "/usr/include/dar/cygwin_adapt.hpp"

In dar version 2.5.3 cat_tools.hpp seams also to be missing.

The missing header files are part of the release but are not copied during 
installation.

I compiled dar from source like:
./configure --prefix=/usr
make
make install

The system I'm using is LinuxMint 17.3, but the problem probably occurs on 
other systems as well.

The compiler option when including libdar is: `pkg-config --cflags libdar`

Do I miss some compiler or config options or are the files just missing?

Best regards,
Tobias