INDEX
Explanations
phrases contrasting opposing views or arguments
New Auto-Interp
Negative Logits
iard
-0.83
ranged
-0.75
tesy
-0.73
-0.72
dropping
-0.70
saw
-0.70
noon
-0.68
ahead
-0.68
rough
-0.68
MpServer
-0.67
POSITIVE LOGITS
beliefs
1.01
reality
0.99
expectations
0.95
belief
0.94
tenets
0.91
orthodoxy
0.89
prevailing
0.88
worldview
0.88
dogma
0.87
ideals
0.87
Activations Density 0.053%