INDEX
Explanations
terms related to superiority or high quality
instances of the word "super."
New Auto-Interp
Negative Logits
76561
-0.78
Seym
-0.74
edin
-0.67
endor
-0.61
EStream
-0.61
hed
-0.61
gow
-0.60
edIn
-0.60
ylon
-0.60
externalActionCode
-0.60
POSITIVE LOGITS
intendent
1.27
visor
1.22
ior
1.14
visors
1.03
vised
1.02
visory
0.91
marine
0.88
natural
0.87
conduct
0.86
super
0.85
Activations Density 0.005%