ThreadGet
This PET worked in FATDOG630
donwloaded and installed in a pristine FATDOG630. Works! Thanks.
Spaced Out?
G'day again Flash,
I tried using a file name with a space (as per your posted image with a space in the file name for the htm file) and got nothing in ForumThreads directory.
Using a space-free file name and I got the desired threads file.
It's an old .pet and maybe needs an update to be able to include spaces in the file name?
David S.
I tried using a file name with a space (as per your posted image with a space in the file name for the htm file) and got nothing in ForumThreads directory.
Using a space-free file name and I got the desired threads file.
It's an old .pet and maybe needs an update to be able to include spaces in the file name?
David S.
broken links
No answer to question
Last edited by tlchost on Mon 10 Feb 2014, 07:55, edited 2 times in total.
Thanks davids45, I hadn't even thought about the space in the filename. I'll give it a try without a space and report back, later.
Later: You were right. When I used an underscore instead of a space, ThreadGet downloaded three pages of a thread from the forum and put them in /root/ForumThreads, in a file I told it to name ThreadGet_test.
Tclhost: how did you open the html file? When I click on it, it opens in SeaMonkey but the links in the file don't seem to work for me either.
Later: You were right. When I used an underscore instead of a space, ThreadGet downloaded three pages of a thread from the forum and put them in /root/ForumThreads, in a file I told it to name ThreadGet_test.
Tclhost: how did you open the html file? When I click on it, it opens in SeaMonkey but the links in the file don't seem to work for me either.
Hi davids45 - yep the download of the program worked, and it is running happily, have run some successful tests so far.
HOWEVER - I am looking at using this as a second stage to HTTracker on forum posts hundreds of pages long.
My method is three stages;
1 - Download the original thread to local, (scraping as much out as possible) - also looking at downloading printer versions of threads to strip out the bulk of the html code.
2 - and strip out as much html info as possible, leaving just the posts with related post date/time, poster name, etc;
3 - Put the scraped pages into one html file.
The main difficulty I am having is in running ThreadGet on locally downloaded websites; they maintain the relative file/folder structure, but the html names of the files have changed in the downloaded version.
When ThreadGet runs on the local html files, it 'ghosts' the dialogue boxes on the screen and goes through the motions, and the result is a blank html file.
Apologies if this sounds a bit vague, I am experimenting on the go, I can give more detail or better descriptions of what is happening.
I wondered if anyone had any issues like this when running ThreadGet on locally stored web pages?
Love the app btw,
@tlchost - I just wondered on your post - if it has merged all thread pages into one html file then presumably the links to page## and next, last etc will not work as they are now referring to other external pages... would this not require converting those links to # bookmarks referring into the same html page?
Thanks for this cracking little app!
HOWEVER - I am looking at using this as a second stage to HTTracker on forum posts hundreds of pages long.
My method is three stages;
1 - Download the original thread to local, (scraping as much out as possible) - also looking at downloading printer versions of threads to strip out the bulk of the html code.
2 - and strip out as much html info as possible, leaving just the posts with related post date/time, poster name, etc;
3 - Put the scraped pages into one html file.
The main difficulty I am having is in running ThreadGet on locally downloaded websites; they maintain the relative file/folder structure, but the html names of the files have changed in the downloaded version.
When ThreadGet runs on the local html files, it 'ghosts' the dialogue boxes on the screen and goes through the motions, and the result is a blank html file.
Apologies if this sounds a bit vague, I am experimenting on the go, I can give more detail or better descriptions of what is happening.
I wondered if anyone had any issues like this when running ThreadGet on locally stored web pages?
Love the app btw,
@tlchost - I just wondered on your post - if it has merged all thread pages into one html file then presumably the links to page## and next, last etc will not work as they are now referring to other external pages... would this not require converting those links to # bookmarks referring into the same html page?
Thanks for this cracking little app!
May I ask a favor? I'm on actually three forums, and one of them is migrating to phpBB from YaBB (specifically YaBB 1 Gold, Service Pack 1.1). I realize that this YaBB version is particularly antiquated at best (one of the main reasons for the move) BUT could ThreadGet be modified somehow to support it? I'd love to help them with the move, and being able to preserve the old posts is more than a little important.
Google tells me that YaBB is based on Perl, rather than PHP.
The forum in question is here --> http://forum.psion2.org/YaBB.pl
Google tells me that YaBB is based on Perl, rather than PHP.
The forum in question is here --> http://forum.psion2.org/YaBB.pl
-
- Posts: 503
- Joined: Mon 09 Sep 2013, 00:00
- Location: Florida, USA
- Contact:
I was thinking of using ThreadGet or 'Mozilla Archive Format' to make a backup of some threads and host them on my site.
This is for when murga-linux.com goes down, as it does occasionally.
Is this allowed?
Mozilla Archive Format (a browser extension)
http://maf.mozdev.org/
http://en.wikipedia.org/wiki/Mozilla_Archive_Format
This is for when murga-linux.com goes down, as it does occasionally.
Is this allowed?
Mozilla Archive Format (a browser extension)
http://maf.mozdev.org/
http://en.wikipedia.org/wiki/Mozilla_Archive_Format
Hey guys, apologies for the delay I have been busy with other things!
However, I am returning to my data projects, and your help and assistance would be invaluable!
The first issue I am having is threadget is making multiple copies of the first html page.
So I set first page as page 1, and number of pages as 7, and I get 7 versions of page 1 concatenated!
Also, when I run Threadget on the same website locally, it results in a blank file.
Any ideas?
In the meantime I shall do a bit more testing, and report back the results.
However, I am returning to my data projects, and your help and assistance would be invaluable!
The first issue I am having is threadget is making multiple copies of the first html page.
So I set first page as page 1, and number of pages as 7, and I get 7 versions of page 1 concatenated!
Also, when I run Threadget on the same website locally, it results in a blank file.
Any ideas?
In the meantime I shall do a bit more testing, and report back the results.
a good pet
Just tried this .pet for the first time tonight using slacko 5.6 and it worked SO well. Thank you so much! This is such a helpful thing for me.
me too
me too