INDEX
Explanations
phrases indicating examples or instances
occurrences of the word "being" and related forms
New Auto-Interp
Negative Logits
PsyNetMessage
-0.75
Dag
-0.71
Beginning
-0.63
robat
-0.62
inav
-0.62
sovere
-0.62
Lag
-0.60
odor
-0.59
Album
-0.57
Surv
-0.57
POSITIVE LOGITS
able
1.10
eaten
0.91
discussed
0.89
hailed
0.88
replaced
0.88
disposed
0.88
consumed
0.88
held
0.87
chased
0.87
phased
0.86
Activations Density 0.057%