INDEX
Explanations
adjectives and nouns related to abstract concepts and beliefs
terms and concepts related to philosophical and ideological discussions
New Auto-Interp
Negative Logits
KEN
-0.76
urion
-0.76
gov
-0.75
Corp
-0.75
tower
-0.73
lain
-0.72
bsite
-0.70
boarding
-0.70
etts
-0.69
ortium
-0.69
POSITIVE LOGITS
prowess
1.09
significance
1.06
superiority
1.02
realities
1.01
impossibility
0.99
differences
0.97
sophistication
0.96
ities
0.95
aspects
0.93
implications
0.91
Activations Density 0.141%