INDEX
Explanations
references to emotional experiences and insecurity
New Auto-Interp
Negative Logits
ãĤ¢ãĥĭãĥ¡
-0.15
δα
-0.14
Albums
-0.14
oca
-0.14
Concert
-0.14
_SF
-0.14
agnost
-0.14
cartoons
-0.14
Phong
-0.14
ocker
-0.13
POSITIVE LOGITS
shooting
0.57
shoot
0.55
shoots
0.50
Shooting
0.45
shot
0.44
shoot
0.42
filming
0.40
shootings
0.39
Shoot
0.39
shots
0.38
Activations Density 0.416%