INDEX
Explanations
phrases indicating a conditional statement
the word "that" in various contexts
New Auto-Interp
Negative Logits
rior
-0.71
Merit
-0.70
emis
-0.69
rypt
-0.67
greg
-0.66
asures
-0.65
izont
-0.64
Shar
-0.63
cycles
-0.63
IAS
-0.63
POSITIVE LOGITS
happens
1.02
happened
1.01
translates
0.95
pesky
0.92
occurs
0.90
mattered
0.90
entails
0.90
occurred
0.89
culminated
0.86
contradicts
0.84
Activations Density 0.177%