Wikia

How To Wiki

How to convert a webpage to readable text

1,795pages on
this wiki
Talk0

You can change a webpage to a readable text file

Web basedEdit

Command lineEdit

You can accomplish this by using lynx

Install lynx, if its not installed already


Grab the text
  • Execute:
    lynx http://www.webpage.org -dump > output-file.txt
  • Example:
    lynx http://www.gentoo.org/doc/en/handbook/handbook-x86.xml?full=1 -dump >handbook-x86.txt
Website2text-websiteLayout

Screenshot of Website used for the following example [1]

Example output
   #[1]Gentoo Website [2]Gentoo Forums [3]Gentoo Bugzilla [4]Gentoo
   Packages [5]Gentoo List Archives

   [6]Gentoo Logo

Gentoo Linux x86 Handbook

   Content:
     * [7]Installing Gentoo
       In this part you learn how to install Gentoo on your system.
         1. [8]About the Gentoo Linux Installation
            This chapter introduces you to the installation approach
            documented in this handbook.
         2. [9]Choosing the Right Installation Medium
            You can install Gentoo in many ways. This chapter explains how
            to install Gentoo using the minimal Installation CD although
            installation through the Installer LiveCD is possible as well.
         3. [10]Configuring your Network
            To be able to download the latest source code, you will need
            to setup networking.
         4. [11]Preparing the Disks
            To be able to install Gentoo, you must create the necessary
            partitions. This chapter describes how to partition a disk for

GraphicalEdit

to be added




From HowTo Wiki, a Wikia wiki.

Around Wikia's network

Random Wiki