INDEX
Explanations
connectors that contrast ideas
ideas that challenge conventional beliefs about individualism and success
New Auto-Interp
Negative Logits
interstitial
-0.81
åĮ
-0.70
Eva
-0.69
awoken
-0.66
ãģ®ç
-0.66
inarily
-0.66
+++
-0.64
cellaneous
-0.63
)))
-0.63
00007
-0.62
POSITIVE LOGITS
nor
1.02
necessarily
1.01
anymore
0.96
\":
0.70
teness
0.70
malice
0.67
ASY
0.64
merits
0.63
any
0.63
anybody
0.62
Activations Density 0.583%