Puppy Linux Discussion Forum Forum Index Puppy Linux Discussion Forum
Puppy HOME page : puppylinux.com
"THE" alternative forum : puppylinux.info
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

The time now is Tue 02 Sep 2014, 05:14
All times are UTC - 4
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Browsers and Internet
ThreadGet
Post new topic   Reply to topic View previous topic :: View next topic
Page 3 of 3 [41 Posts]   Goto page: Previous 1, 2, 3
Author Message
gcmartin

Joined: 14 Oct 2005
Posts: 4238
Location: Earth

PostPosted: Tue 04 Feb 2014, 01:37    Post subject: This PET worked in FATDOG630  

donwloaded and installed in a pristine FATDOG630. Works! Thanks.
_________________
Get ACTIVE Create Circles; Do those good things which benefit people's needs!
We are all related ... Its time to show that we know this!
3 Different Puppy Search Engine or use DogPile
Back to top
View user's profile Send private message 
davids45


Joined: 26 Nov 2006
Posts: 647
Location: Chatswood, NSW

PostPosted: Tue 04 Feb 2014, 03:30    Post subject: Spaced Out?  

G'day again Flash,

I tried using a file name with a space (as per your posted image with a space in the file name for the htm file) and got nothing in ForumThreads directory.

Using a space-free file name and I got the desired threads file.

It's an old .pet and maybe needs an update to be able to include spaces in the file name?

David S.
Back to top
View user's profile Send private message 
tlchost

Joined: 05 Aug 2007
Posts: 1681
Location: Baltimore, Maryland USA

PostPosted: Tue 04 Feb 2014, 08:13    Post subject: broken links  

No answer to question
Last edited by tlchost on Mon 10 Feb 2014, 03:55; edited 2 times in total
Back to top
View user's profile Send private message Visit poster's website 
Flash
Official Dog Handler


Joined: 04 May 2005
Posts: 11034
Location: Arizona USA

PostPosted: Tue 04 Feb 2014, 09:28    Post subject:  

Thanks davids45, I hadn't even thought about the space in the filename. Embarassed I'll give it a try without a space and report back, later.

Later: You were right. When I used an underscore instead of a space, ThreadGet downloaded three pages of a thread from the forum and put them in /root/ForumThreads, in a file I told it to name ThreadGet_test. Very Happy

Tclhost: how did you open the html file? When I click on it, it opens in SeaMonkey but the links in the file don't seem to work for me either.
Back to top
View user's profile Send private message 
count

Joined: 30 Jan 2014
Posts: 5

PostPosted: Tue 04 Feb 2014, 20:48    Post subject:  

Hi davids45 - yep the download of the program worked, and it is running happily, have run some successful tests so far.

HOWEVER - I am looking at using this as a second stage to HTTracker on forum posts hundreds of pages long.

My method is three stages;
1 - Download the original thread to local, (scraping as much out as possible) - also looking at downloading printer versions of threads to strip out the bulk of the html code.
2 - and strip out as much html info as possible, leaving just the posts with related post date/time, poster name, etc;
3 - Put the scraped pages into one html file.

The main difficulty I am having is in running ThreadGet on locally downloaded websites; they maintain the relative file/folder structure, but the html names of the files have changed in the downloaded version.

When ThreadGet runs on the local html files, it 'ghosts' the dialogue boxes on the screen and goes through the motions, and the result is a blank html file.

Apologies if this sounds a bit vague, I am experimenting on the go, I can give more detail or better descriptions of what is happening.

I wondered if anyone had any issues like this when running ThreadGet on locally stored web pages?

Love the app btw, Very Happy

@tlchost - I just wondered on your post - if it has merged all thread pages into one html file then presumably the links to page## and next, last etc will not work as they are now referring to other external pages... would this not require converting those links to # bookmarks referring into the same html page?

Thanks for this cracking little app!
Back to top
View user's profile Send private message 
Flash
Official Dog Handler


Joined: 04 May 2005
Posts: 11034
Location: Arizona USA

PostPosted: Tue 04 Feb 2014, 21:02    Post subject:  

Count: downloading a thread that is hundreds of pages long will slow down the forum's server for everyone else while you're doing it. Please consider that while you're experimenting. Smile
Back to top
View user's profile Send private message 
count

Joined: 30 Jan 2014
Posts: 5

PostPosted: Fri 07 Feb 2014, 19:13    Post subject:    

Flash wrote:
Count: downloading a thread that is hundreds of pages long will slow down the forum's server for everyone else while you're doing it. Please consider that while you're experimenting. Smile


Hi Flash, yes, thank you, I was aware of that too! It wasn't this forum btw Smile
Back to top
View user's profile Send private message 
starhawk

Joined: 22 Nov 2010
Posts: 2844
Location: Everybody knows this is nowhere...

PostPosted: Sun 16 Feb 2014, 18:32    Post subject:  

May I ask a favor? I'm on actually three forums, and one of them is migrating to phpBB from YaBB (specifically YaBB 1 Gold, Service Pack 1.1). I realize that this YaBB version is particularly antiquated at best (one of the main reasons for the move) BUT could ThreadGet be modified somehow to support it? I'd love to help them with the move, and being able to preserve the old posts is more than a little important.

Google tells me that YaBB is based on Perl, rather than PHP.

The forum in question is here --> http://forum.psion2.org/YaBB.pl

_________________
Loving X-Slacko 2.1!
Custom Build: HP MOCA-AR + Core2Duo T7200 + 4gb RAM + 256gb SSD
...just needs a pretty case Wink
Back to top
View user's profile Send private message 
version2013

Joined: 08 Sep 2013
Posts: 78

PostPosted: Mon 23 Jun 2014, 13:45    Post subject:  

I was thinking of using ThreadGet or 'Mozilla Archive Format' to make a backup of some threads and host them on my site.
This is for when murga-linux.com goes down, as it does occasionally.

Is this allowed?


Mozilla Archive Format (a browser extension)
http://maf.mozdev.org/
http://en.wikipedia.org/wiki/Mozilla_Archive_Format
Back to top
View user's profile Send private message Visit poster's website 
Flash
Official Dog Handler


Joined: 04 May 2005
Posts: 11034
Location: Arizona USA

PostPosted: Mon 23 Jun 2014, 16:29    Post subject:  

Yes, with proper attribution.
Back to top
View user's profile Send private message 
count

Joined: 30 Jan 2014
Posts: 5

PostPosted: Wed 06 Aug 2014, 15:29    Post subject:  

Hey guys, apologies for the delay I have been busy with other things!

However, I am returning to my data projects, and your help and assistance would be invaluable!

The first issue I am having is threadget is making multiple copies of the first html page.

So I set first page as page 1, and number of pages as 7, and I get 7 versions of page 1 concatenated!

Also, when I run Threadget on the same website locally, it results in a blank file.

Any ideas?

In the meantime I shall do a bit more testing, and report back the results.
Back to top
View user's profile Send private message 
Display posts from previous:   Sort by:   
Page 3 of 3 [41 Posts]   Goto page: Previous 1, 2, 3
Post new topic   Reply to topic View previous topic :: View next topic
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Browsers and Internet
Jump to:  

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Powered by phpBB © 2001, 2005 phpBB Group
[ Time: 0.0679s ][ Queries: 12 (0.0069s) ][ GZIP on ]