INDEX
Explanations
phrases referring to different perspectives or opinions
references to perspectives or viewpoints
New Auto-Interp
Negative Logits
sshd
-0.70
é¾
-0.70
apons
-0.67
anon
-0.66
20439
-0.62
nom
-0.62
sbm
-0.61
Brave
-0.61
Cipher
-0.61
bid
-0.60
POSITIVE LOGITS
tains
0.76
differentiation
0.74
view
0.71
contention
0.69
reference
0.68
oland
0.67
overlap
0.66
divergence
0.65
Exile
0.64
reference
0.64
Activations Density 0.049%