INDEX
Explanations
phrases related to controversy or disagreement
references to varying experiences and opinions among different groups of people
New Auto-Interp
Negative Logits
Sphere
-0.74
Faul
-0.73
Dug
-0.70
enegger
-0.68
issance
-0.65
................
-0.64
DRAGON
-0.64
mie
-0.62
Gene
-0.60
========
-0.60
POSITIVE LOGITS
outright
0.82
subconscious
0.80
unconsciously
0.73
depending
0.72
downright
0.72
unwanted
0.68
specialize
0.67
arser
0.66
unnoticed
0.66
rants
0.66
Activations Density 0.847%