Question 1

I am trying to install NLTK Data on Mac OSX 10.9 . The download directory to be set, as mentioned in NLTK 3.0 documentation, is /usr/share/nltk_data for central installation. But for this path, I get the error OSError: [Errno 13] Permission denied: '/usr/share/nltk_data'

Can I set the download directory as /Users/ananya/nltk_data for central installation?

I have Python 2.7 installed in my machine

Thanks, Ananya

Question 2

Have you tried:

$ sudo python
>>> import nltk
>>> nltk.download()

To check if the downloads work, try a few of the corpora that you have downloaded, e.g.

>>> from nltk.corpus import wordnet
>>> wordnet.synsets('dog')
[Synset('dog.n.01'), Synset('frump.n.01'), Synset('dog.n.03'), Synset('cad.n.01'), Synset('frank.n.02'), Synset('pawl.n.01'), Synset('andiron.n.01'), Synset('chase.v.01')]

If the corpora are not installed properly, you will see something like this:

Traceback (most recent call last):File "<stdin>", line 1, in <module>File "/usr/local/lib/python2.7/dist-packages/nltk/corpus/util.py", line 68, in __getattr__self.__load()File "/usr/local/lib/python2.7/dist-packages/nltk/corpus/util.py", line 56, in __loadexcept LookupError: raise e
LookupError: 
**********************************************************************Resource 'corpora/wordnet' not found.  Please use the NLTKDownloader to obtain the resource:  >>> nltk.download()Searched in:- '/home/alvas/nltk_data'- '/usr/share/nltk_data'- '/usr/local/share/nltk_data'- '/usr/lib/nltk_data'- '/usr/local/lib/nltk_data'
**********************************************************************

NLTK Data installation issues

Related Q&A

Why does the simplest requests_mock example fail with pytest?

Error installing PyCurl

Serializing objects containing django querysets

How do I list my scheduled queries via the Python google client API?

What does conda env do under the hood?

Numpy array larger than RAM: write to disk or out-of-core solution?

Pandas DataFrame styler - How to style pandas dataframe as excel table?

Remove namespace with xmltodict in Python

Groupby count only when a certain value is present in one of the column in pandas

how to save tensorflow model to pickle file