INDEX
Explanations
proper nouns or uncommon/gibberish words in a specific format
terms related to utopian and dystopian concepts
New Auto-Interp
Negative Logits
thresholds
-0.80
signatures
-0.70
warnings
-0.70
microphones
-0.69
markings
-0.68
caveats
-0.67
votes
-0.64
stickers
-0.62
nods
-0.62
respondents
-0.61
POSITIVE LOGITS
iphate
0.79
capitalist
0.78
organism
0.77
ocracy
0.76
institution
0.76
ifestyle
0.75
orld
0.74
cosystem
0.71
tarian
0.71
lite
0.70
Activations Density 0.670%