View Full Version : Wiki site PDF dump?
Is there any one that knows how I can make an offline version of the TOH wiki so that I can use the wiki information offline ?
PS: Yes, there are homes in UK that have no internet ... :sulk:
will look into that as part of updating and improving BIKI ... no ETA atm.
but thanks for pointing it out ...
Nov 8 2011, 20:35
You can get the lite version of the COMREF which is pretty much the same thing.
On the wiki page just click 'Ctrl' + 'S' And save it as a webpage html or something for viewing offline.
There is a tool called HTTrack (http://www.httrack.com/) that downloads whole sites for offline browsing. But you should be careful with this tool, our you might download… THE WHOLE INTERNETZ!!!!1
Just kidding. :)
Dwarden, The fact is that the system seems (I'm not sure how good it is) to be already developed but should be run on server side. See: http://www.mediawiki.org/wiki/Extension:Pdf_Export
The only downside of this method is that "only" goes down one link level, however that would be good as it limits the bandwidth / server CPU and memory required and it would be more than enough to capture the massive: http://community.bistudio.com/wiki/Category:Scripting_Commands_Take_On_Helicopters
Anyway, as usual, thanks for taking in to account our feedback.
Tankbuster, I know that PDF file and part of the reason I made that post is because I was wandering how those guys did it... I made a pretty extensive research and I have found many ways to do that on wikipedia (apparently they dump they XML database regularly for mirroring purposes) but not a single one that works in "MediaWiki". Thanks anyway for your spot on answer.
konyo, While you are correct you would need many hours and patience to rework the (roughly) 1300 links that relate to TOH as well as to download every single page... your proposal doesn't look feasible to me, but thanks.
fboes, I know this tool as well as WGET and some others, have you had any experience trying to download a wiki site? (this is not a rhetorical question) On the other hand I'm not sure this is an acceptable method (this is worse than a robot/web crawler), yes I know you can limit the assigned bandwidth but it is still a systematic dump... chances are that a firewall gets upset with such a reiterative IP entry... Thanks
We were working with HTTrack to do an offline version of a whole internet magazine (you had to submit your site as offline cd-rom for a contest… don't ask… :) ). We set it up to play nice with the servers (because there was normal traffic), only allowed it to go down to a certain level, and told it to be really slow.
So it can be done - but you have to be really careful with your settings. The best way of doing this is to ask permission, and do it in low-traffic hours (like 2 am).
And it is far less brutal then a Google bot swamping you with 10,000 requests in a matter of seconds. :)
Isaac that extension is obsolete and beta, i aim first for stable extensions or actively developed and tested as working ...
fboes, well, it is always good to learn new things, if I finally decide to go ahead with this I will ask first ... Oh, wait a second, :butbut: now I can foresee some extra issues, I didn't realized at the beginning, let's say I finally get a html render dump of the wiki database... ok, so now I have an offline copy of the same information formatted in the very same way, in other words I solved my lack of connection at home but I just made an offline port of the contents... (Ok, I have to give you that, you solved 50% of the problem :p).
The reason I started this thread is to have kind of a structured PDF so that I can print the wiki FPDR and have that information, black over white, when I'm in the editor because I don't think it is practical to have more applications running in parallel...
Dwarden, thanks for the link, I didn't realized of the beta status... :o Sorry about that.
[Me thinking loud] How the heck those guys did that wonderful PDF (http://www.ofpec.com/ed_depot/index.php?action=details&id=391&game=ArmA)? Going link by link? :386:
Powered by vBulletin® Version 4.2.0 Copyright © 2013 vBulletin Solutions, Inc. All rights reserved.