TextNet Acknowledgements

TextNet is a free online tool for creating text networks. You can either insert plain text or upload a PDF document and TextNet will turn it into a network. You have to name your project so you can easily download it later. This page acknowledges the contributions either direct or indirect to this project

TextNet

Dissertation Supervisors

This project was created during the completion of a Masters Degree at the University of Cape Town. My dissertation supervisors are listed below

  • Dr. Ian Durbach
  • Dr. Melvin Varughese
  • Dr. Bruce Basset

Libraries and Software Used

This project made use of various open source software packages and libraries, they are listed below. The limitations of these software packages apply to TextNet and it is best to track these limitations through the software packages directly.

Limitations and Errors

  • The PDF Parser can fail. This is indicated by some error that usually stops TextNet. This can happen for various reasons. The most important is that the PDF's features are not supported. An example is encryption. If the PDF file is large or might have special features it is preferable to use an online PDF parser or copy the text and use the text field on the home page.
  • The GEXF file can be ill formed. After the plain text or PDF have been converted into a grph the graph is converted to a GEXF file. If this file is ill formed the visualisation willl not appear. This happens for various reasons the most common is unsupported characters. This issue can be resolved by finding the unsupported character, which includes finding the file.
    • To find the file change the url in the browser from visualisation.php?resource=YOUR_PROJECT_NAME to files/YOUR_PROJECT_NAME/graph.gexf. If there is a problem with the GEXF file the statment This page contains the following errors: will appear. Beneath this statement an error and line number will be provided. You can view the file by right clicking and selecting view source on most browsers. It is typically faster however to right click and select save as which will intitate a file download.
    • Once the file is found the unsupported characters can be removed. These characters typically include
      • The ampersand &
      • Any accented character like à, è, ì, ò, ù . There character can eb found here
      Once the character are removed the file can be opened in Gephi for visualisation. Gephi is likely to produce some errors but if ignored the process should proceed.
  • If there are any errors not address here please email me and I will try to address them as soon as possible.