INDEX
Explanations
phrases indicating a surprising or revealing discovery
instances of the phrase "it turns out."
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-1.05
accompan
-0.80
rongh
-0.70
riot
-0.68
Expansion
-0.67
Repeat
-0.64
aign
-0.59
76561
-0.58
Reserved
-0.57
riots
-0.57
POSITIVE LOGITS
out
1.08
entious
0.77
orned
0.66
inward
0.65
out
0.65
outs
0.63
hift
0.62
enum
0.60
forth
0.60
doubtful
0.60
Activations Density 0.017%