INDEX
Explanations
quotes or statements of different types
references to quotes and quotations
New Auto-Interp
Negative Logits
raine
-0.69
icultural
-0.65
ixture
-0.65
olescent
-0.65
icable
-0.64
stay
-0.62
ickle
-0.59
roller
-0.59
ipation
-0.59
endant
-0.58
POSITIVE LOGITS
quotes
4.23
quotations
2.96
quote
2.61
quotation
2.47
quoted
1.94
quoting
1.90
quote
1.64
Quote
1.63
Quotes
1.60
excerpts
1.50
Activations Density 0.010%