ccextractor-users Mailing List for ccextractor (Page 2)
Brought to you by:
cfsmp3
You can subscribe to this list here.
2007 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(2) |
Jun
(2) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
2008 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(1) |
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
|
2010 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2011 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(1) |
Nov
|
Dec
|
2012 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(4) |
Sep
(1) |
Oct
|
Nov
|
Dec
|
2013 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(1) |
Nov
|
Dec
|
2014 |
Jan
(1) |
Feb
(4) |
Mar
(4) |
Apr
(13) |
May
(63) |
Jun
(27) |
Jul
(4) |
Aug
(3) |
Sep
(4) |
Oct
|
Nov
(2) |
Dec
|
2015 |
Jan
|
Feb
(1) |
Mar
(13) |
Apr
(1) |
May
(2) |
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2016 |
Jan
(2) |
Feb
(1) |
Mar
(14) |
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
2017 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Vishal G. <vis...@gm...> - 2015-03-12 17:50:31
|
Hello everyone, This is Vishal Gupta. I am studying Computer Science at International Institute of Information Technology, Hyderabad, India. I have worked with C, C++ and Python among others. My projects can be found here <https://github.com/gvishal>. I have experience in developing web applications in Python and PHP. I enjoy building API's and back-ends and have previously used Google App Engine(webapp2) and django to develop some applications. I(in a team) also developed a load testing framework <https://github.com/gvishal/Load-Testing-Tool> which provided a high level of concurrency using Python and Gevent. I am really interested in the "Worldwide repository" project and would like to know further about it. Also, where can I find the POC that is mentioned in the project description? I have also implemented as part of a project, a File Sharing program which required us to build a trivial ftp using TCP/UDP which supports file transfer and other operations. I am thus also interested in the project "Real time uploading". Regards, Vishal |
From: Carlos F. S. <ca...@cc...> - 2015-03-12 11:24:21
|
> > I had a doubt in the Linux and OSX GUI idea. Would you prefer a cross > platform GUI using languages like Python, C++ or OS dedicated ones. Because Portability is always a good thing. > Also is there any progress with the Garbled output issue? I have not yet > located the bug. There's some people looking into it but no solution yet as far as I know. > CCExtractor does detect it as MP4 file. Can you provide a sample for this > issue? I don't have one at this time but when I find one that triggers the problem I'll send it to you. > I want to try and learn adding more support for ccextractor. It looks > extremely exciting and I can learn something good from it. How do you > suggest I start? By figuring out what you would enjoy doing :-) |
From: Nurendra C. <nur...@gm...> - 2015-03-12 08:59:28
|
Hi, I had a doubt in the Linux and OSX GUI idea. Would you prefer a cross platform GUI using languages like Python, C++ or OS dedicated ones. Because the cross platform ones can be used in the future for other platforms too whereas the dedicated ones will have limited use. Also is there any progress with the Garbled output issue? I have not yet located the bug. Is this https://drive.google.com/file/d/0B_8j_IupNoCtOWdmVUtvTkwtSWM/view?usp=sharing the mp4 file mentioned in https://github.com/CCExtractor/ccextractor/pull/58/files#r14395363 CCExtractor does detect it as MP4 file. Can you provide a sample for this issue? <https://drive.google.com/file/d/0B_8j_IupNoCtOWdmVUtvTkwtSWM/view?usp=sharing> I want to try and learn adding more support for ccextractor. It looks extremely exciting and I can learn something good from it. How do you suggest I start? -Nurendra Choudhary. On Sun, Mar 8, 2015 at 2:34 AM, Carlos Fernandez Sanz < ca...@cc...> wrote: > On Sat, Mar 7, 2015 at 2:54 AM, Nurendra Choudhary < > nur...@gm...> wrote: > > About the garbled output. The codebase is pretty big so it may take me >> some time to figure it out in the code. But I have found a pattern in the >> output. >> > > We can't however solve the problem by trying to use the observed pattern > to move things to the correct place. For starters because you can't tell > when you need to do it or not. > >> >> Are you sure this is not an error in the CC of the video because I tried >> others and the srt came out well? >> > > In theory it's not a problem in the video because it plays OK in the > device that recorded it. > > >> Could you help me in finding out the module of this error? >> > > I think it's in the code that sorts the CC data (in sequencing.c) but this > is the kind of bug that takes more time to locate than fix. > > |
From: Hasala P. <per...@ya...> - 2015-03-12 08:28:28
|
Hello I am a postgraduate student reading for my MSc in Computer Science. I am interested in opting for the project on Automatic Translation and would like to know more information about it, please kind enough to reply me Thank You and RegardsHasala Perera |
From: Abhinav S. <abh...@gm...> - 2015-03-07 17:43:36
|
Hello everyone! I am Abhinav Shukla, a Computer Science student at the International Institute of Information Technology, Hyderabad. I have a fair amount of development experience in various fields like Game Development, Robot Management, Mapping and Routing etc and in a wide variety of programming languages. I really like the work that CCExtractor has done, and would love to be a part of the development team, both through and after GSoC. I'm still finding my feet with CCExtractor, however I have started trying to make my own tiny contributions to try and improve the code. which you can see here:- https://github.com/Abhinav95/ccextractor I would really appreciate any feedback which would steer me in the right direction. Looking forward to a delightful summer of code :) Cheers! Abhinav |
From: Carlos F. S. <ca...@cc...> - 2015-03-06 19:12:43
|
Your proposals was not accepted for two reasons 1) It just included 2 tasks, that we considered should take around 1 month, not 3. 2) We just had 3 slots and there were 3 proposals that we found a bit better. That's it. In the end it all comes to which proposals are best. Personally I had problems sleeping the days before knowing how many slots we'd get and whether I'd have to reject someone I really wanted to have on board. On the other hand, your proposal was very well written and it was clear you took it seriously. If there had been a 4th slot it would have been yours. By all means don't let last year's experience discourage you now :-) We are now a veteran organization so with luck we'll get an extra slot or two. On Fri, Mar 6, 2015 at 5:06 PM, Mahesh Bandara Wijerathna <m4...@gm... > wrote: > That's so funny. The translation thing was in my proposal for GSOC 2014 > > http://m4heshd.blogspot.com/2014/03/my-project-proposal-for-gsoc-2014.html > > So weird it's in the ideas list on this year in the exact same manner. > > On Fri, Mar 6, 2015 at 5:02 PM, Nurendra Choudhary < > nur...@gm...> wrote: > >> Hi, >> >> I am Nurendra Choudhary, a student at International Institute of >> Information Technology - Hyderabad. I am doing my major in Computational >> Linguistics and like coding in Python, C, C++. My interests lie in Natural >> Language Processing, Artificial Intelligence, Machine Learning and Machine >> Translation. >> >> I went through the ideas page of ccextractor and also the archives of the >> mailing list. I found the project "Automatic translation" very interesting >> and would like to do it in GSOC 2015 under ccextractor. >> >> I have previously worked with a lot of Google APIs like Maps, Web Speech, >> etc and also translation systems like Apertium. I have used the web >> interface of Translate many times but never really used the API. (I am >> thinking of doing that in the next few days.) Could you guide me a little >> on this? >> >> I started doing the GSOC tasks as advised on the ideas page. >> Here's the code for the first one: >> https://github.com/Akirato/AlgorithmCodes/tree/master/Miscellaneous/getAllCfunctions >> Is this program good enough? >> >> For the next one, I compiled the code using 'cmake' and got some warnings >> about unused variables. >> Are these the ones that are supposed to be worked on? >> The best way to obviously solve these warnings is to remove the unused >> variables from the code entirely. >> >> I am moving on to the next task for now. >> Am I on the right track or would you recommend doing something else? >> >> Regards, >> Nurendra Choudhary. >> >> >> >> ------------------------------------------------------------------------------ >> Dive into the World of Parallel Programming The Go Parallel Website, >> sponsored >> by Intel and developed in partnership with Slashdot Media, is your hub >> for all >> things parallel software development, from weekly thought leadership >> blogs to >> news, videos, case studies, tutorials and more. Take a look and join the >> conversation now. http://goparallel.sourceforge.net/ >> _______________________________________________ >> ccextractor-users mailing list >> cce...@li... >> https://lists.sourceforge.net/lists/listinfo/ccextractor-users >> >> > > > -- > > |
From: Mahesh B. W. <m4...@gm...> - 2015-03-06 16:07:27
|
That's so funny. The translation thing was in my proposal for GSOC 2014 http://m4heshd.blogspot.com/2014/03/my-project-proposal-for-gsoc-2014.html So weird it's in the ideas list on this year in the exact same manner. On Fri, Mar 6, 2015 at 5:02 PM, Nurendra Choudhary < nur...@gm...> wrote: > Hi, > > I am Nurendra Choudhary, a student at International Institute of > Information Technology - Hyderabad. I am doing my major in Computational > Linguistics and like coding in Python, C, C++. My interests lie in Natural > Language Processing, Artificial Intelligence, Machine Learning and Machine > Translation. > > I went through the ideas page of ccextractor and also the archives of the > mailing list. I found the project "Automatic translation" very interesting > and would like to do it in GSOC 2015 under ccextractor. > > I have previously worked with a lot of Google APIs like Maps, Web Speech, > etc and also translation systems like Apertium. I have used the web > interface of Translate many times but never really used the API. (I am > thinking of doing that in the next few days.) Could you guide me a little > on this? > > I started doing the GSOC tasks as advised on the ideas page. > Here's the code for the first one: > https://github.com/Akirato/AlgorithmCodes/tree/master/Miscellaneous/getAllCfunctions > Is this program good enough? > > For the next one, I compiled the code using 'cmake' and got some warnings > about unused variables. > Are these the ones that are supposed to be worked on? > The best way to obviously solve these warnings is to remove the unused > variables from the code entirely. > > I am moving on to the next task for now. > Am I on the right track or would you recommend doing something else? > > Regards, > Nurendra Choudhary. > > > > ------------------------------------------------------------------------------ > Dive into the World of Parallel Programming The Go Parallel Website, > sponsored > by Intel and developed in partnership with Slashdot Media, is your hub for > all > things parallel software development, from weekly thought leadership blogs > to > news, videos, case studies, tutorials and more. Take a look and join the > conversation now. http://goparallel.sourceforge.net/ > _______________________________________________ > ccextractor-users mailing list > cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users > > -- |
From: Nurendra C. <nur...@gm...> - 2015-03-06 11:32:21
|
Hi, I am Nurendra Choudhary, a student at International Institute of Information Technology - Hyderabad. I am doing my major in Computational Linguistics and like coding in Python, C, C++. My interests lie in Natural Language Processing, Artificial Intelligence, Machine Learning and Machine Translation. I went through the ideas page of ccextractor and also the archives of the mailing list. I found the project "Automatic translation" very interesting and would like to do it in GSOC 2015 under ccextractor. I have previously worked with a lot of Google APIs like Maps, Web Speech, etc and also translation systems like Apertium. I have used the web interface of Translate many times but never really used the API. (I am thinking of doing that in the next few days.) Could you guide me a little on this? I started doing the GSOC tasks as advised on the ideas page. Here's the code for the first one: https://github.com/Akirato/AlgorithmCodes/tree/master/Miscellaneous/getAllCfunctions Is this program good enough? For the next one, I compiled the code using 'cmake' and got some warnings about unused variables. Are these the ones that are supposed to be worked on? The best way to obviously solve these warnings is to remove the unused variables from the code entirely. I am moving on to the next task for now. Am I on the right track or would you recommend doing something else? Regards, Nurendra Choudhary. |
From: Hardik C. <har...@ii...> - 2015-03-04 15:52:37
|
Hi, this is Hardik again. I worked on the listed tasks and had a few questions. Apologies in advance if any of the queries seem trivial. * I compiled ccextractor on my ubuntu vm and got 2 warnings( 1) unused return value of 'write' and 2)call to __builtin___strncat_chk might overflow destination buffer). Strncat is safe to use as long as we know the length of the buffer and what is to go in the buffer, which is I think true in this code's case. But the first warning can be easily removed with an if condition check on the return value of 'write', it would also probably be more prudent to do so. Is there any specific reason for not doing this? * To try and get ccextractor to crash I thought it would be a good idea to embed captions on videos of different formats manually and pass these as input. (I played with the captioned video Anshul had provided me earlier, it helped) But there doesn't seem to be any open source software that currently does this. I later found that this is an idea in the list of project ideas for this years GSoC. Is there any open source software that currently does this, or is this an entirely new idea, or is this being done just to expand ccextractor's functionality? It would be convenient to have even the trial version of a paid software to generate manually captioned videos. Thank you! Hardik Choudhary BTech CSE 3rd year IIIT Delhi |
From: Anshul <ans...@gm...> - 2015-03-03 08:52:29
|
Hello I am not previliged enough to give you whole repository. but To play with ccextractor, go with this link <http://gsocdev.ccextractor.org/%7Eanshul/test_video/Closedcaption_rollup.ts>ed Video -Anshul On 03/03/2015 12:40 PM, Hardik Choudhary wrote: > Hello > This is Hardik Choudhary, a Computer Science undergrad from IIIT > Delhi, India. I have looked at the ideas page, and installed > CCextractor on my Windows 7. I would love to check out the video > sample repository to see where things work and where they go wrong. If > I get a feel for how this is working, I will start work on the written > tasks. Also can I know how many other people are working on these > tasks currently? > Thank you very much. > Hardik Choudhary > BTech CSE 3rd year > IIIT Delhi > > > ------------------------------------------------------------------------------ > Dive into the World of Parallel Programming The Go Parallel Website, sponsored > by Intel and developed in partnership with Slashdot Media, is your hub for all > things parallel software development, from weekly thought leadership blogs to > news, videos, case studies, tutorials and more. Take a look and join the > conversation now. http://goparallel.sourceforge.net/ > > > _______________________________________________ > ccextractor-users mailing list > cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users |
From: Hardik C. <har...@ii...> - 2015-03-03 07:10:40
|
Hello This is Hardik Choudhary, a Computer Science undergrad from IIIT Delhi, India. I have looked at the ideas page, and installed CCextractor on my Windows 7. I would love to check out the video sample repository to see where things work and where they go wrong. If I get a feel for how this is working, I will start work on the written tasks. Also can I know how many other people are working on these tasks currently? Thank you very much. Hardik Choudhary BTech CSE 3rd year IIIT Delhi |
From: rohit s. <roh...@gm...> - 2015-02-22 11:11:38
|
Hi. I am new here and want to contribute for this organisation. I know c and python languages. I am also famailer with HTML, CSS, java language. So please someone tell me how and from where to start? On which project i can work? I am also comfortable with windows and ubuntu. |
From: Anshul <ans...@gm...> - 2014-11-03 21:03:50
|
On November 3, 2014 11:00:31 PM IST, Priyank Bhatt <pri...@gm...> wrote: >I am willing to work for CCExtractor . I'm a fourth year undergraduate >Student , I have done my elimentary course on machine learning >algorithm . >I am also familiar with C/C++ and basis Python,HTML,CSS,Javascript . >Please suggest what all do I need to learn or read so as to start >contributing to the open source community of CCExtractor. As an amateur >and >as >per your free slots what project form the idea's page should I work on >? > > >------------------------------------------------------------------------ > >------------------------------------------------------------------------------ > > >------------------------------------------------------------------------ > >_______________________________________________ >ccextractor-users mailing list >cce...@li... >https://lists.sourceforge.net/lists/listinfo/ccextractor-users You should start by downloading code from github https://github.com/CCExtractor/ccextractor There are lots of issue you can see in the github, you can start with any issue you like. Tell me the platform you are comfortable. Is it linux, windows or mac -Anshul -- Sent from my Android device with K-9 Mail. Please excuse my brevity. |
From: Priyank B. <pri...@gm...> - 2014-11-03 17:30:37
|
I am willing to work for CCExtractor . I'm a fourth year undergraduate Student , I have done my elimentary course on machine learning algorithm . I am also familiar with C/C++ and basis Python,HTML,CSS,Javascript . Please suggest what all do I need to learn or read so as to start contributing to the open source community of CCExtractor. As an amateur and as per your free slots what project form the idea's page should I work on ? |
From: Rohit S. <roh...@gm...> - 2014-09-02 11:30:54
|
Thank you. I will. |
From: Carlos F. <cf...@gm...> - 2014-09-02 10:54:02
|
What Willem said :-) Taking care of any issue in github (or implementing anything you are personally interested in) is probably the best idea. On Tue, Sep 2, 2014 at 12:44 PM, Willem Van Iseghem <wil...@gm...> wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hello, > > If you want to contribute, I'd suggest you go to the Github issue lis > (https://github.com/CCExtractor/ccextractor/issues), pick out an issue > and start working on it, and make a pull request afterwards :) > Otherwise you should get in touch with Carlos, who's the main > developer of the program (or wait untill he sees your message on this > list). > > Sincerely, > Willem > (A student from GSoC who worked on CCExtractor) > > On 2/09/2014 11:30, Rohit Shinde wrote: >> Hello all, >> >> I am a CS student studying in India. I came across this >> organisation while browsing the GSOC 2014 accepted organisations >> list. The concept piqued my interest. And now, I would like to >> contribute. >> >> I am quite proficient in C,C++,Java and Python. How can I >> contribute? >> >> Thanks, in advance. >> >> >> ------------------------------------------------------------------------------ >> >> > Slashdot TV. >> Video for Nerds. Stuff that matters. http://tv.slashdot.org/ >> >> >> >> _______________________________________________ ccextractor-users >> mailing list cce...@li... >> https://lists.sourceforge.net/lists/listinfo/ccextractor-users >> > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.10 (MingW32) > Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ > > iQEcBAEBAgAGBQJUBZ+GAAoJEER4GtAYK/sNM1sH/1ZwIWCj7AQ77NXrA7pubLHg > zx7ahGEfEGsdFMzaL4+13suoqcb8MsgCoYVAZuYrf6n23KWjNEdSDEwBVSynhx7a > 8ozUY+tZob+Ie/KEXSNkkW6AD7WSV9TI9Ig/lrlaCD7G0Mu10XZsV3hDG/owKMHr > GaDt8pW0cOV6tqxgBWWJX5T+kAbhV1BONBxsmAI+PDc7uWSQCwVd3RhMiK1nCPcK > DxP/wsLncUSvAd9JqcZBvldb+zOnYZ2smdCQnpk0vo/OtrQMUAMrAwD3LQhzWkT9 > yp38vAaZz9qrDjzn3WD6THYQtR6ov6m2Aqb3M5uK9+F76EZpKnuGo3Il1TJqRAI= > =pNXI > -----END PGP SIGNATURE----- > > ------------------------------------------------------------------------------ > Slashdot TV. > Video for Nerds. Stuff that matters. > http://tv.slashdot.org/ > _______________________________________________ > ccextractor-users mailing list > cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users |
From: Willem V. I. <wil...@gm...> - 2014-09-02 10:44:38
|
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hello, If you want to contribute, I'd suggest you go to the Github issue lis (https://github.com/CCExtractor/ccextractor/issues), pick out an issue and start working on it, and make a pull request afterwards :) Otherwise you should get in touch with Carlos, who's the main developer of the program (or wait untill he sees your message on this list). Sincerely, Willem (A student from GSoC who worked on CCExtractor) On 2/09/2014 11:30, Rohit Shinde wrote: > Hello all, > > I am a CS student studying in India. I came across this > organisation while browsing the GSOC 2014 accepted organisations > list. The concept piqued my interest. And now, I would like to > contribute. > > I am quite proficient in C,C++,Java and Python. How can I > contribute? > > Thanks, in advance. > > > ------------------------------------------------------------------------------ > > Slashdot TV. > Video for Nerds. Stuff that matters. http://tv.slashdot.org/ > > > > _______________________________________________ ccextractor-users > mailing list cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users > -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (MingW32) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQEcBAEBAgAGBQJUBZ+GAAoJEER4GtAYK/sNM1sH/1ZwIWCj7AQ77NXrA7pubLHg zx7ahGEfEGsdFMzaL4+13suoqcb8MsgCoYVAZuYrf6n23KWjNEdSDEwBVSynhx7a 8ozUY+tZob+Ie/KEXSNkkW6AD7WSV9TI9Ig/lrlaCD7G0Mu10XZsV3hDG/owKMHr GaDt8pW0cOV6tqxgBWWJX5T+kAbhV1BONBxsmAI+PDc7uWSQCwVd3RhMiK1nCPcK DxP/wsLncUSvAd9JqcZBvldb+zOnYZ2smdCQnpk0vo/OtrQMUAMrAwD3LQhzWkT9 yp38vAaZz9qrDjzn3WD6THYQtR6ov6m2Aqb3M5uK9+F76EZpKnuGo3Il1TJqRAI= =pNXI -----END PGP SIGNATURE----- |
From: Rohit S. <roh...@gm...> - 2014-09-02 09:30:13
|
Hello all, I am a CS student studying in India. I came across this organisation while browsing the GSOC 2014 accepted organisations list. The concept piqued my interest. And now, I would like to contribute. I am quite proficient in C,C++,Java and Python. How can I contribute? Thanks, in advance. |
From: Carlos F. S. <ca...@cc...> - 2014-08-24 09:08:59
|
Looks like a bug indeed... I'm CC'ing the developers list. Someone has been working on .raw lately so probably best if they look into it. Also, you can open a bug ticket in github, that's where we're doing all tracking these days. https://github.com/CCExtractor/ccextractor/issues On Sat, Aug 23, 2014 at 7:24 PM, Brody Hansen <bwh...@sl...> wrote: > Hi Carlos, > > > > First let me say, this looks like a brilliant tool – and it’s great to see > someone showing an active interest in developing an utility that handles > close caption formats so robustly! > > > > I’m just writing to report a potential bug. > > > > My input files are m4v with CEA-608 stream (iTunes). If I output to SRT > it works fine. But attempting to output to RAW / DVDRAW / BIN doesn’t > work. Samples of all output files attached. > > > > Command line: > > C:\Program Files (x86)\CCExtractor\ccextractorwin.exe --gui_mode_reports > -in=mp4 -autoprogram -out=raw -o "C:\somefolder\s01e01_ccextract.raw" > -latin1 --nofontcolor --notypesetting -noteletext [+input files] > > > > The only option I’m changing (other than the output file name) is the –out > argument. I’ve also attached a copy of the CCExtractor GUI *Activity* > output. > > > > Please don’t hesitate to let me know if you would like any further > information. > > > > Kind Regards, > > Brody Hansen. > > > |
From: Carlos F. S. <ca...@cc...> - 2014-08-13 19:16:09
|
Hi, Just noticed this. I'm CC'ing the dev. mailing list and Anshul specifically since he's been working on OCR stuff lately. Most likely he'll be able to look into it. Carlos On Fri, Aug 1, 2014 at 11:57 PM, Bryan Christianson <br...@wh...> wrote: > Hi Carlos > > The problem: Here in NZ (and I believe also in Europe) Freeview TV > transmits dvbs subtitles - i.e embedded images in the transport stream type > 6. I record the stream with EyeTV (Mac PVR software) and then use Plex > Media Server to play it back to AppleTV. Unfortunately, Plex ignores the > dvbs subtitles so I am attempting to extract them with ccextractor, convert > back to text with tesseract and then generate a .srt file which is > recognized by Plex. > > It's mostly working, but I have had to fudge the end time for each > subtitle in the set. The durations in the generated spupng xml file seem to > be very short. > > Here's a sample of the .xml file and you can see that the end time is only > a few 1/10 secs later than the start time. In generating the .srt file I am > applying a heuristic: if(end - start < 2.5) end = start + 2.5 > > I have also noticed that many of the subtitle images are blank - i.e. > contain no text. > > I have put a 1 minute .mpg file on my web server if that will help: > http://whatroute.net/sample.mpg > > A side note - I created an XCode(5.1) project to compile ccextractor if > you are interested. > > Regards > Bryan > > -- > Sample xml generated with: > $EXTRACT -gt --no_progress_bar -autoprogram -streamtype 6 -out=spupng > $stream.mpg -o $TEMP/$stream > > <U+FEFF><subpictures> > <stream> > <!-- 0000000019499b5c.mpg --> > <spu start="0.000" end="0.001" image="" xoffset="88" yoffset="46"> > </spu> > <spu start="2.964" end="3.064" > image="/tmp/subtitles/0000000019499b5c.d/sub0000.png" xoffset="88" > yoffset="46"> > </spu> > <spu start="4.274" end="4.374" > image="/tmp/subtitles/0000000019499b5c.d/sub0001.png" xoffset="452" > yoffset="0"> > </spu> > <spu start="6.384" end="6.484" > image="/tmp/subtitles/0000000019499b5c.d/sub0002.png" xoffset="452" > yoffset="0"> > </spu> > <spu start="7.704" end="7.744" > image="/tmp/subtitles/0000000019499b5c.d/sub0003.png" xoffset="486" > yoffset="0"> > </spu> > <spu start="9.484" end="9.574" > image="/tmp/subtitles/0000000019499b5c.d/sub0004.png" xoffset="452" > yoffset="0"> > </spu> > <spu start="11.214" end="11.264" > image="/tmp/subtitles/0000000019499b5c.d/sub0005.png" xoffset="452" > yoffset="0"> > > > -- > Bryan Christianson > br...@wh... > > > > |
From: Carlos F. <cf...@gm...> - 2014-07-02 18:48:02
|
Let's focus on 0.70 to be released this week :-) If something is not ready now then it won't be in 0.70, that's all. Whatever it is, will have to wait until 0.71. Are all new options explained in the help screen and at least mentioned in CHANGES.TXT? Does the current github build in all platforms? No crashes? All files OK with the test suite or at least nothing that used to work not working now? If yes - then we're ready. If not, let's do whatever is missing. Let's work with a deadline: Friday, July 4th 12:00 UTC (since we're all now in the same side of the Atlantic I think it's a reasonable time of the day). Whatever is missing please make sure to pull-request it by then. I'll prepare 0.70 and release it with the github version. After that, two things can happen: A major screw up is reported by someone and we have to fix and release 0.71 as a bugfix release quickly, or no one reports anything and we continue on the rest of the major items, which to summarize was -off the top of my head- the librarization, the OCR, and the network code. On Wed, Jul 2, 2014 at 7:10 PM, Anshul Maheshwari <ans...@gm...> wrote: > Though I am working on output context of all kind of subtitle, > but I dont know whether It would be good for this release. > Srt or txt files can be generated only after that. > > Binary diff of png image is not a better idea, it will increase testing > time. > and never make sure when both reference and output garbage. > It will also fail when libpng change its compression level or algorithm. > > IMO ( in my opinion ) it would be good if test suite ccextractor can be > build with ocr > support and xml file generated have comments what is written on picture. > then comparing xml files would be a better case. > > -Anshul > > > > On Wed, Jul 2, 2014 at 9:13 PM, Willem Van Iseghem < > wil...@gm...> wrote: > >> >> -----BEGIN PGP SIGNED MESSAGE----- >> Hash: SHA1 >> >> Hi all, >> >> I've ran the tests again, and everything is fine (except for one file >> that seemed broken, but a check revealed it was incorrectly copied). >> >> Reports can be found here: >> http://gsocdev.ccextractor.org/~willem/0.70ReleaseTest/ >> >> Some remarks on the reports: >> - - "Broken" files are the ones that generate bad or incorrect output. >> These were already broken in 0.69 (and might have been broken for ever). >> Most of them have incorrect timings or have DVB subtitles, which can't >> be extracted except when using the spupng option. All info on these can >> be found on the dev server in >> /repository/newRepository/TestFiles/Broken/_Files.txt >> - - DVB folder tests on srt/txt output. I'll discuss with Anshul how we >> could best test output on DVB files. >> - - Errors in General are related to a change in spacing we did or the >> fixes we've applied. >> - - MP4: "Error" is my change adding support for cdt2 atoms >> - - Teletext: Changes due to custom transcripts now. >> - - XDS: Changes in case and small time changes. Checked them all, no >> real issues. >> >> So the current github built, if released now, won't break anything >> that's covered in the tests. >> >> Sincerely, >> Willem >> >> >> >> -----BEGIN PGP SIGNATURE----- >> Version: GnuPG v1.4.10 (MingW32) >> Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ >> >> iQEcBAEBAgAGBQJTtCiSAAoJEER4GtAYK/sNRkMH/1daOA979JTVJq1aUM+b3UVf >> CsPb6xIXAE4uIWDXu/O3WMuHHJiKW2MQnyRSPyAGqTwkYptEo/B7W1cX1xI/8A0M >> HcUDpMKWki9KMZ++QO/nQ203O1kuMHY6UXbamrtJ/a3i3LdXfc8eFY7KKGsk/tOq >> WKu2B3nuLZm75hNkUpsu/S71z5af3uEOVl8w/HOHy/4WJqCz3Ac+yNTAVFTRlU5u >> gFXrsZFkRrEzopoGtccFVK2VksFb2bPXWhR28MQvQ61tprpu/VFiBkd1vELmZUuI >> sfE0VQmtAoV4F2e68vgSuCgjoLVbf6cdnpTqV48V3kgT4nr2N7AMGftRl3sWmEg= >> =Rccw >> -----END PGP SIGNATURE----- >> >> >> >> ------------------------------------------------------------------------------ >> Open source business process management suite built on Java and Eclipse >> Turn processes into business applications with Bonita BPM Community >> Edition >> Quickly connect people, data, and systems into organized workflows >> Winner of BOSSIE, CODIE, OW2 and Gartner awards >> http://p.sf.net/sfu/Bonitasoft >> _______________________________________________ >> ccextractor-users mailing list >> cce...@li... >> https://lists.sourceforge.net/lists/listinfo/ccextractor-users >> > > > > ------------------------------------------------------------------------------ > Open source business process management suite built on Java and Eclipse > Turn processes into business applications with Bonita BPM Community Edition > Quickly connect people, data, and systems into organized workflows > Winner of BOSSIE, CODIE, OW2 and Gartner awards > http://p.sf.net/sfu/Bonitasoft > _______________________________________________ > ccextractor-users mailing list > cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users > > |
From: Anshul M. <ans...@gm...> - 2014-07-02 17:10:40
|
Though I am working on output context of all kind of subtitle, but I dont know whether It would be good for this release. Srt or txt files can be generated only after that. Binary diff of png image is not a better idea, it will increase testing time. and never make sure when both reference and output garbage. It will also fail when libpng change its compression level or algorithm. IMO ( in my opinion ) it would be good if test suite ccextractor can be build with ocr support and xml file generated have comments what is written on picture. then comparing xml files would be a better case. -Anshul On Wed, Jul 2, 2014 at 9:13 PM, Willem Van Iseghem < wil...@gm...> wrote: > > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hi all, > > I've ran the tests again, and everything is fine (except for one file > that seemed broken, but a check revealed it was incorrectly copied). > > Reports can be found here: > http://gsocdev.ccextractor.org/~willem/0.70ReleaseTest/ > > Some remarks on the reports: > - - "Broken" files are the ones that generate bad or incorrect output. > These were already broken in 0.69 (and might have been broken for ever). > Most of them have incorrect timings or have DVB subtitles, which can't > be extracted except when using the spupng option. All info on these can > be found on the dev server in > /repository/newRepository/TestFiles/Broken/_Files.txt > - - DVB folder tests on srt/txt output. I'll discuss with Anshul how we > could best test output on DVB files. > - - Errors in General are related to a change in spacing we did or the > fixes we've applied. > - - MP4: "Error" is my change adding support for cdt2 atoms > - - Teletext: Changes due to custom transcripts now. > - - XDS: Changes in case and small time changes. Checked them all, no > real issues. > > So the current github built, if released now, won't break anything > that's covered in the tests. > > Sincerely, > Willem > > > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.10 (MingW32) > Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ > > iQEcBAEBAgAGBQJTtCiSAAoJEER4GtAYK/sNRkMH/1daOA979JTVJq1aUM+b3UVf > CsPb6xIXAE4uIWDXu/O3WMuHHJiKW2MQnyRSPyAGqTwkYptEo/B7W1cX1xI/8A0M > HcUDpMKWki9KMZ++QO/nQ203O1kuMHY6UXbamrtJ/a3i3LdXfc8eFY7KKGsk/tOq > WKu2B3nuLZm75hNkUpsu/S71z5af3uEOVl8w/HOHy/4WJqCz3Ac+yNTAVFTRlU5u > gFXrsZFkRrEzopoGtccFVK2VksFb2bPXWhR28MQvQ61tprpu/VFiBkd1vELmZUuI > sfE0VQmtAoV4F2e68vgSuCgjoLVbf6cdnpTqV48V3kgT4nr2N7AMGftRl3sWmEg= > =Rccw > -----END PGP SIGNATURE----- > > > > ------------------------------------------------------------------------------ > Open source business process management suite built on Java and Eclipse > Turn processes into business applications with Bonita BPM Community Edition > Quickly connect people, data, and systems into organized workflows > Winner of BOSSIE, CODIE, OW2 and Gartner awards > http://p.sf.net/sfu/Bonitasoft > _______________________________________________ > ccextractor-users mailing list > cce...@li... > https://lists.sourceforge.net/lists/listinfo/ccextractor-users > |
From: Willem V. I. <wil...@gm...> - 2014-07-02 15:43:36
|
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi all, I've ran the tests again, and everything is fine (except for one file that seemed broken, but a check revealed it was incorrectly copied). Reports can be found here: http://gsocdev.ccextractor.org/~willem/0.70ReleaseTest/ Some remarks on the reports: - - "Broken" files are the ones that generate bad or incorrect output. These were already broken in 0.69 (and might have been broken for ever). Most of them have incorrect timings or have DVB subtitles, which can't be extracted except when using the spupng option. All info on these can be found on the dev server in /repository/newRepository/TestFiles/Broken/_Files.txt - - DVB folder tests on srt/txt output. I'll discuss with Anshul how we could best test output on DVB files. - - Errors in General are related to a change in spacing we did or the fixes we've applied. - - MP4: "Error" is my change adding support for cdt2 atoms - - Teletext: Changes due to custom transcripts now. - - XDS: Changes in case and small time changes. Checked them all, no real issues. So the current github built, if released now, won't break anything that's covered in the tests. Sincerely, Willem -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (MingW32) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQEcBAEBAgAGBQJTtCiSAAoJEER4GtAYK/sNRkMH/1daOA979JTVJq1aUM+b3UVf CsPb6xIXAE4uIWDXu/O3WMuHHJiKW2MQnyRSPyAGqTwkYptEo/B7W1cX1xI/8A0M HcUDpMKWki9KMZ++QO/nQ203O1kuMHY6UXbamrtJ/a3i3LdXfc8eFY7KKGsk/tOq WKu2B3nuLZm75hNkUpsu/S71z5af3uEOVl8w/HOHy/4WJqCz3Ac+yNTAVFTRlU5u gFXrsZFkRrEzopoGtccFVK2VksFb2bPXWhR28MQvQ61tprpu/VFiBkd1vELmZUuI sfE0VQmtAoV4F2e68vgSuCgjoLVbf6cdnpTqV48V3kgT4nr2N7AMGftRl3sWmEg= =Rccw -----END PGP SIGNATURE----- |
From: Carlos F. S. <ca...@cc...> - 2014-07-01 10:33:36
|
I've applied all pull requests. At least it builds :-) There was a conflict in CHANGES.TXT though, so please recheck in case something is missing. |
From: Ruslan K. <kuc...@gm...> - 2014-06-20 18:31:17
|
Hello all! So I finally got it working. I've created the part that receives captions and "sends" them to the web site. Here how it works: 1)client sends data to server application 2)server writes them to buffer file 3)when you open web site, it send AJAX requests every second to the script which returns unread captions from buffer file. Instead the CCExtractor there is still a program which reads file with captions and sends them in a loop. Buffer file have specified number of lines. Old lines a removed when new ones are added. It's made because otherwise it can grow very large and php would have to read entire file every time. (I didn't implement it using named pipes, they wouldn't work with multiple readers. Maybe I reinvented the wheel.) Sending AJAX requests every seconds (or less) seems to be pretty "real-time". Anyway I don't see another way of implementing it using JavaScript. Maybe Java applet or Flash thing can receive signals from server, but it seems to be difficult. This delays are not critical. I dind't add archiving captions on the server and other important things for now. I need some feedback. Sources and binaries are in my public_html. http://gsocdev.ccextractor.org/~ruslan/ By the way, I need someone to ask for help about PHP and JavaScript. I almost don't have any experience with them. Respectfully, Ruslan Kuchumov. |