INDEX
Explanations
phrases related to utopian or dystopian visions
terms related to utopia and dystopia
New Auto-Interp
Negative Logits
backer
-0.65
hips
-0.65
Drawn
-0.64
Radar
-0.64
graded
-0.64
Bridge
-0.63
Gladiator
-0.62
Cobra
-0.62
rons
-0.62
Analyst
-0.60
POSITIVE LOGITS
ut
4.35
utf
1.37
Ut
1.36
Ut
1.32
utopian
1.32
dystop
1.32
UTF
1.21
UT
1.09
UTF
1.03
san
0.94
Activations Density 0.012%