INDEX
Explanations
phrases related to personal beliefs and political opinions
New Auto-Interp
Negative Logits
phase
-0.69
defined
-0.64
Aren
-0.63
cation
-0.63
UNCH
-0.62
noon
-0.62
sequence
-0.62
olkien
-0.60
ftime
-0.59
trak
-0.59
POSITIVE LOGITS
own
1.61
arms
1.05
fingers
1.04
fists
1.04
selves
1.01
sights
1.01
hands
0.99
belongings
0.99
entire
0.99
fingerprints
0.96
Activations Density 0.125%