Questions tagged [stanford-nlp]
Questions on the open source natural language processing software from the Stanford University NLP Group, in Java, Python, and C, including Stanford CoreNLP, Stanza, and GloVe.
3,474
questions
186
votes
10
answers
105k
views
Java Stanford NLP: Part of Speech labels?
The Stanford NLP, demo'd here, gives an output like this:
Colorless/JJ green/JJ ideas/NNS sleep/VBP furiously/RB ./.
What do the Part of Speech tags mean? I am unable to find an official list. Is it ...
99
votes
18
answers
113k
views
How to use Stanford Parser in NLTK using Python
Is it possible to use Stanford Parser in NLTK? (I am not talking about Stanford POS.)
93
votes
3
answers
4k
views
How to train the Stanford Parser with Genia Corpus?
I have some problems to create a new model for Stanford Parser.
I have also downloaded the last version from Stanford:
http://nlp.stanford.edu/software/lex-parser.shtml
And here, Genia Corpus in 2 ...
34
votes
8
answers
17k
views
NLTK vs Stanford NLP
I have recently started to use NLTK toolkit for creating few solutions using Python.
I hear a lot of community activity regarding using Stanford NLP.
Can anyone tell me the difference between NLTK and ...
30
votes
10
answers
36k
views
Stanford nlp for python
All I want to do is find the sentiment (positive/negative/neutral) of any given string. On researching I came across Stanford NLP. But sadly its in Java. Any ideas on how can I make it work for python?...
29
votes
3
answers
7k
views
How can a tree be encoded as input to a neural network?
I have a tree, specifically a parse tree with tags at the nodes and strings/words at the leaves. I want to pass this tree as input into a neural network all the while preserving its structure.
...
28
votes
12
answers
37k
views
How can I split a text into sentences using the Stanford parser?
How can I split a text or paragraph into sentences using Stanford parser?
Is there any method that can extract sentences, such as getSentencesFromString() as it's provided for Ruby?
28
votes
3
answers
20k
views
Ease of use: Stanford CoreNLP vs. OpenNLP [closed]
I looking to use a suite of NLP tools for a personal project, and I was wondering whether Stanford's CoreNLP is easier to use or OpenNLP. Or is there another free package you would reccomend?
I haven'...
28
votes
3
answers
17k
views
How to detect that two sentences are similar?
I want to compute how similar two arbitrary sentences are to each other. For example:
A mathematician found a solution to the problem.
The problem was solved by a young mathematician.
I ...
28
votes
5
answers
30k
views
How to Train GloVe algorithm on my own corpus
I tried to follow this.
But some how I wasted a lot of time ending up with nothing useful.
I just want to train a GloVe model on my own corpus (~900Mb corpus.txt file).
I downloaded the files provided ...
28
votes
3
answers
18k
views
Is it possible to train Stanford NER system to recognize more named entities types?
I'm using some NLP libraries now, (stanford and nltk)
Stanford I saw the demo part but just want to ask if it possible to use it to identify more entity types.
So currently stanford NER system (as ...
27
votes
6
answers
27k
views
Extract list of Persons and Organizations using Stanford NER Tagger in NLTK
I am trying to extract list of persons and organizations using Stanford Named Entity Recognizer (NER) in Python NLTK.
When I run:
from nltk.tag.stanford import NERTagger
st = NERTagger('/usr/share/...
26
votes
3
answers
10k
views
Maven fails to download CoreNLP models
When building the sample application from the Stanford CoreNLP website, I ran into a curious exception:
Exception in thread "main" java.lang.RuntimeException: edu.stanford.nlp.io.RuntimeIOException: ...
26
votes
2
answers
14k
views
What's the major difference between glove and word2vec?
What is the difference between word2vec and glove?
Are both the ways to train a word embedding? if yes then how can we use both?
24
votes
2
answers
10k
views
What is "unk" in the pretrained GloVe vector files (e.g. glove.6B.50d.txt)?
I found "unk" token in the glove vector file glove.6B.50d.txt downloaded from https://nlp.stanford.edu/projects/glove/. Its value is as follows:
unk -0.79149 0.86617 0.11998 0.00092287 0.2776 -0....
24
votes
1
answer
1k
views
pronoun resolution backwards
The usual coreference resolution works in the following way:
Provided
The man likes math. He really does.
it figures out that
he
refers to
the man.
There are plenty of tools to do this.
...
23
votes
3
answers
17k
views
Training n-gram NER with Stanford NLP
Recently I have been trying to train n-gram entities with Stanford Core NLP. I have followed the following tutorials - http://nlp.stanford.edu/software/crf-faq.shtml#b
With this, I am able to specify ...
20
votes
2
answers
11k
views
Stanford Parser tags
I just started using Stanford Parser but I do not understand the tags very well. This might be a stupid question to ask but can anyone tell me what does the SBARQ and SQ tags represent and where can I ...
19
votes
8
answers
10k
views
Multi-term named entities in Stanford Named Entity Recognizer
I'm using the Stanford Named Entity Recognizer http://nlp.stanford.edu/software/CRF-NER.shtml and it's working fine. This is
List<List<CoreLabel>> out = classifier.classify(text);
...
19
votes
1
answer
9k
views
EM score in SQuAD Challenge
The SQuAD Challenge ranks the results against the F1 and EM scores. There is a lot of information about the F1 score (a function of precision and recall). But what would the EM score be?
19
votes
2
answers
16k
views
what is distant supervision?
According to my understanding, Distant Supervision is the process of specifying the concept which the individual words of a passage, usually a sentence, are trying to convey.
For example, a database ...
18
votes
4
answers
11k
views
Tools for text simplification (Java) [closed]
What is the best tool that can do text simplification using Java?
Here is an example of text simplification:
John, who was the CEO of a company, played golf.
↓
John played ...
18
votes
1
answer
10k
views
What does the tag SBAR mean in Stanford’s parse-tree representation?
When the Online Stanford Parser tool is fed this original sentence:
After she ate the cake, Emma visited Tony in his room.
It produces the following parse-tree representation as its output:
(...
18
votes
1
answer
21k
views
stanford core nlp java output
I'm a newbie with Java and Stanford NLP toolkit and trying to use them for a project. Specifically, I'm trying to use Stanford Corenlp toolkit to annotate a text (with Netbeans and not command line) ...
17
votes
3
answers
9k
views
Accuracy: ANNIE vs Stanford NLP vs OpenNLP with UIMA
My work is planning on using a UIMA cluster to run documents through to extract named entities and what not. As I understand it, UIMA have very few NLP components packaged with it. I've been testing ...
17
votes
4
answers
13k
views
how to get a dependency tree with Stanford NLP parser
How can I get the dependency tree as Figure below. I can get the dependency relation as pure text, and also the dependency graph with the help of dependencysee tool. But how about the dependency tree ...
16
votes
1
answer
11k
views
Clause Extraction using Stanford parser
I have a complex sentence and I need to separate it into main and dependent clause.
For example for the sentence
ABC cites the fact that chemical additives are banned in many countries and feels they ...
16
votes
4
answers
25k
views
How can I extract address from raw text using NLTK in python?
I have this text
'''Hi, Mr. Sam D. Richards lives here, 44 West 22nd Street, New
York, NY 12345. Can you contact him now? If you need any help, call
me on 12345678'''
. How the address part ...
16
votes
3
answers
11k
views
Stanford Core NLP - understanding coreference resolution
I'm having some trouble understanding the changes made to the coref resolver in the last version of the Stanford NLP tools.
As an example, below is a sentence and the corresponding ...
16
votes
1
answer
8k
views
Is it possible to freeze only certain embedding weights in the embedding layer in pytorch?
When using GloVe embedding in NLP tasks, some words from the dataset might not exist in GloVe. Therefore, we instantiate random weights for these unknown words.
Would it be possible to freeze weights ...
16
votes
4
answers
4k
views
Sentence compression using NLP [closed]
Using Machine translation, can I obtain a very compressed version of a sentence,
eg. I would really like to have a delicious tasty cup of coffee would be translated to I want coffee
Does any of the ...
15
votes
3
answers
4k
views
How do I manipulate parse trees?
I've been playing around with natural language parse trees and manipulating them in various ways. I've been using Stanford's Tregex and Tsurgeon tools but the code is a mess and doesn't fit in well ...
15
votes
3
answers
13k
views
OpenNLP vs Stanford CoreNLP
I've been doing a little comparison of these two packages and am not sure which direction to go in. What I am looking for briefly is:
Named Entity Recognition (people, places, organizations and such)....
15
votes
4
answers
8k
views
how do I create my own training corpus for stanford tagger?
I have to analyze informal english text with lots of short hands and local lingo. Hence I was thinking of creating the model for the stanford tagger.
How do i create my own set of labelled corpus for ...
15
votes
3
answers
15k
views
How to train the Stanford NLP Sentiment Analysis tool
Hell everyone! I'm using the Stanford Core NLP package and my goal is to perform sentiment analysis on a live-stream of tweets.
Using the sentiment analysis tool as is returns a very poor analysis ...
15
votes
1
answer
382
views
'Can't return head of null or leaf Tree' with CoreNLP on Android
I want to use CoreNLP in my Android project. But when I create a CoreNLP instance like this:
import java.util.Properties;
import edu.stanford.nlp.ling.CoreAnnotations;
import edu.stanford.nlp.neural....
15
votes
2
answers
4k
views
Extracting multi word named entities using CoreNLP
I'm using CoreNLP for named entity extraction and have run into a bit of an issue.
The issue is that whenever a named entity is composed of more than one token, such as "Han Solo", the annotator does ...
14
votes
3
answers
11k
views
Difference between feature selection, feature extraction, feature weights
I am slightly confused as to what "feature selection / extractor / weights" mean and the difference between them. As I read the literature sometimes I feel lost as I find the term used quite loosely, ...
14
votes
2
answers
24k
views
NLTK was unable to find stanford-postagger.jar! Set the CLASSPATH environment variable
I am working on a project that requires me to tag tokens using nltk and python. So I wanted to use this. But came up with a few problems.
I went through a lot of other already asked questions and ...
14
votes
1
answer
9k
views
How do I use python interface of Stanford NER(named entity recogniser)?
I want to use Stanford NER in python using pyner library. Here is one basic code snippet.
import ner
tagger = ner.HttpNER(host='localhost', port=80)
tagger.get_entities("University of California is ...
14
votes
1
answer
4k
views
Maven dependency:get does not download Stanford NLP model files
The core component of the Stanford Natural Language Processing Toolkit has Java code in a stanford-corenlp-1.3.4.jar file, and has (very large) model files in a separate stanford-corenlp-1.3.4-models....
13
votes
1
answer
23k
views
Scala get file path of file in resources folder
I am using the Stanford CRFClassifier and in order to run, it requires a file that is the trained classifier model. I have put this file in the resources directory. From the Javadocs for the ...
13
votes
3
answers
18k
views
Extracting noun phrases from a text file using stanford typed parser
I have a text which I want to extract the noun phrases from it. I can easily get the typed parser for the text that i have, but wondering how i can extract the noun phrases in the text ?
13
votes
5
answers
8k
views
Extracting the relationship between entities in Stanford CoreNLP
I want to extract the complete relationship between two entities using Stanford CoreNLP (or maybe other tools).
For example:
Windows is more popular than Linux.
This tool requires Java.
...
13
votes
3
answers
11k
views
How to extract the noun phrases using Open nlp's chunking parser
I am newbie to Natural Language processing.I need to extract the noun phrases from the text.So far i have used open nlp's chunking parser for parsing my text to get the Tree structure.But i am not ...
13
votes
2
answers
1k
views
Converting tokens to word vectors effectively with TensorFlow Transform
I would like to use TensorFlow Transform to convert tokens to word vectors during my training, validation and inference phase.
I followed this StackOverflow post and implemented the initial ...
12
votes
3
answers
8k
views
Need a good relation extractor
I'm doing a NLP project.
The purpose of the project is to extract possible relationship between two things. For example, for a pair "location" and "person" the extracted results would be "near", "...
12
votes
4
answers
18k
views
Coreference resolution in python nltk using Stanford coreNLP
Stanford CoreNLP provides coreference resolution as mentioned here, also this thread, this, provides some insights about its implementation in Java.
However, I am using python and NLTK and I am not ...
12
votes
2
answers
6k
views
How to improve speed with Stanford NLP Tagger and NLTK
Is there any way to use the Standford Tagger in a more performant fashion?
Each call to NLTK's wrapper starts a new java instance per analyzed string which is very very slow especially when a larger ...