INDEX
Explanations
references to specific fictional or historical characters, locations, and concepts
New Auto-Interp
Negative Logits
Anthem
-0.72
Bleach
-0.69
Kenobi
-0.68
wagen
-0.66
rans
-0.64
Nirvana
-0.61
Obj
-0.60
backs
-0.59
internationally
-0.58
juggling
-0.57
POSITIVE LOGITS
ASY
1.18
nerg
1.14
fficient
1.13
ighty
1.11
isner
1.08
TERN
1.07
cosystem
1.07
AST
1.07
tymology
1.07
YE
1.06
Activations Density 0.021%