INDEX
Explanations
phrases related to legal matters or actions
instances of the word "def" in various contexts, often related to definitions or derogatory terms
New Auto-Interp
Negative Logits
externalActionCode
-0.69
Archdemon
-0.68
WER
-0.65
åŃ
-0.63
ONES
-0.63
ppo
-0.62
Galile
-0.61
Franks
-0.60
sth
-0.60
DragonMagazine
-0.59
POSITIVE LOGITS
ibr
1.05
ensible
1.03
acements
0.95
acement
0.93
ector
0.92
inished
0.91
def
0.89
luent
0.87
def
0.86
initions
0.86
Activations Density 0.007%