Skip to content

Commit a76a854

Browse files
committed
Find fronted 'said' ccomps, with a few negative detections to avoid other likely tree structures which are similar but aren't actually fronted ccomp - see comments
1 parent c5ba427 commit a76a854

File tree

1 file changed

+8
-1
lines changed

1 file changed

+8
-1
lines changed

src/edu/stanford/nlp/trees/UniversalEnglishGrammaticalRelations.java

+8-1
Original file line numberDiff line numberDiff line change
@@ -575,7 +575,14 @@ private UniversalEnglishGrammaticalRelations() {}
575575
"@NP < JJ|NN|NNS < (SBAR=target [ !<(S < (VP < TO )) | !$-- NP|NN|NNP|NNS ] )",
576576
// New ones to pick up some more "say" patterns (2019); avoid S-ADV descendants
577577
"VP < (/^V/ < " + sayVerbRegex + ") < (S|S-CLF|S-TTL|SQ=target <+(S) (VP < /^VB[DZP]$/))",
578-
"@S < /^S-TPC/=target < VP"
578+
"@S < /^S-TPC/=target < VP",
579+
// detect fronted VPs, eg
580+
// "not finding this ccomp is bad, he said"
581+
// eliminate VP !< SBAR to avoid detecting
582+
// "he was debugging and (VP saying (SBAR he wanted to find the ccomp))"
583+
// eliminate S !< (VP < (/^VB[GN]/ !$-- /^V/)) to avoid detecting
584+
// (S (NP Rick Lynch) (S (VP (VBG referring to ...))) (VP says ...))
585+
"S < (S=target $++ (VP < (/^V/ < " + sayVerbRegex + ") !< SBAR) !< (VP < (/^VB[GN]/ !$-- /^V/)))"
579586
);
580587

581588

0 commit comments

Comments
 (0)