INDEX
Explanations
verbs with 'ing'suffix that involve some sort of action or process
terms and phrases related to assertion and prediction
New Auto-Interp
Negative Logits
avorite
-0.76
Solitaire
-0.67
impulse
-0.65
felon
-0.65
RAW
-0.64
Forever
-0.64
bye
-0.63
curfew
-0.63
uncontroll
-0.63
americ
-0.62
POSITIVE LOGITS
aled
1.06
ighed
1.05
ving
1.02
istered
1.02
ues
1.01
oused
1.00
uing
0.99
arding
0.99
ued
0.98
aling
0.97
Activations Density 0.235%