INDEX
Explanations
the word "the" followed by a specific word or phrase
frequent occurrences of the word "the"
New Auto-Interp
Negative Logits
Chun
-0.77
Reich
-0.71
Kurdistan
-0.67
Chamber
-0.66
Mafia
-0.63
desperately
-0.63
Kuh
-0.62
consolidation
-0.62
Reconstruction
-0.61
Schwe
-0.61
POSITIVE LOGITS
_
1.15
vern
0.98
bsite
0.98
toc
0.97
package
0.95
visible
0.94
bidden
0.94
next
0.94
anmar
0.93
mosp
0.91
Activations Density 0.184%