However, when you are done, you will have a complete, recentĬopy of English Wikipedia with images that can fit on a 128GB SD card. Numbers increase to 100GB of disk space and 30 hours of processing The import itself will require 80GB of disk space and five hours Setting all this up on your computer will not be a quick process. XOWA is compatible with Microsoft Windows, MacOS, Linux and Android. There are 13.9+ million pages with 20.0+ GB of text, as well as 3.7+ million thumbnails. XOWA is a free and open-source application written primarily in Java by anomous developers and is intended for users whish to run their own copy of Wikipedia, or any other compatible Wiki offline or without an internet connection. If you want a complete mirror of Wikipedia (including images) full HTML formatting intact that will download in aprox 30 hours, you should use:Įnglish Wikipedia has a lot of data. Later edit for the case you want also the images offline: XOWA Project Guide and tools to run a full offline mirror of with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump. XOWA is a free and open-source application written primarily in Java by anonymous developers and is intended for users who wish to run their own copy of Wikipedia, or any other compatible Wiki, offline without an internet connection. If too many people would do that it can flood the sites with requests. Taking Wikimedia stuff with wget is not good practice. As the comment specify in order to take other MediaWiki sites for kiwix mwoffliner can be used, it may not work with all since they may have custom differences but it is the only variant I came across. Or you can use Kiwix, faster to set up because it also provides the already processed dumps (. The experience is very similar to the browser experience. I know of xowafor instance (though the main dev seems to have moved on and not worked on their code since March), and someone recently flagged GoldenDictfor Wiktionary files on our Github repo. That file will be used by WikiTaxi program to search through articles. We sometimes hear from alternatives to Kiwix, but more often than not they completely fly under our radar and theres plenty of good learnings we miss. taxi(around 15Gb for Wikipedia) file out of the. Can build its own offline data from dumps or ZIM files.You can take the 2 from the Wikimedia dumps site and process them with WikiTaxi (download in upper left corner). Some of the main use cases I’ve seen for such projects as Kiwix and XOWA are to make educational and informational content available to areas with poor or no internet connection ( there’s a story of a carrier pigeon transferring a 4 GB memory stick in South Africa in the time it took the data to transfer 4 on the regular internet connection). Cannot build its own offline data but has available many of the Wikimedia projects in many languages already pre-built.įastWiki which works on Windows, Linux and Android. Okawix which works on Windows, Linux, Mac and Android. Time displaying all the matching term in separate tabs. Kiwix does when you first load an offline data file, also is muchįaster, and it searches in all the available dictionaries at the same Aard Dictionary has available many of the Wikimedia projects in many languages already pre-built but also other dictionaries like WordNet, etc. Can build its own offline data from Wikimedia dumps or XDXF files with the help of Aard Tools(Aard Tools needs linux/unix and pyhton 2.7). Other notable projects beside Xowa, Wikitaxi and Kiwix mentioned already, are:Īard Dictionary is a free software dictionary with binaries for Windows, Linux, Mac and Android. XOWA is a free and open-source application written primarily in Java by anonymous developers and is intended for users who wish to run their own copy of Wikipedia, or any other compatible Wiki, offline without an internet connection. (I tested the bellow article completely offline) I was wondering if there is any relationship between XOWA databases and the KIWIX ZIM files XOWA does not use the ZIM file format. However, when you are done, you will have a complete, recentĬopy of English Wikipedia with images that can fit on a 128GB SD card.īut the offline version is very much like the online version, includes photos etc: There are 13.9+ million pages with 20.0+ GB of text, as well as 3.7+ million thumbnails. If you want a complete mirror of Wikipedia (including images), Wiktionari, Wikiquote, etc full HTML formatting intact that will download in aprox 30 hours, you should use:Įnglish Wikipedia has a lot of data.
0 Comments
Leave a Reply. |