INDEX
Explanations
numeric values
numerical values or identifiers
New Auto-Interp
Negative Logits
Luther
-0.77
tis
-0.76
Flor
-0.75
Ariel
-0.73
Venezuel
-0.70
ε
-0.69
Pengu
-0.66
ģĸ
-0.66
riel
-0.65
Alvin
-0.64
POSITIVE LOGITS
escape
0.71
ictive
0.69
iculty
0.69
hire
0.67
acia
0.67
erent
0.66
cially
0.65
ournament
0.65
ADA
0.64
enced
0.64
Activations Density 0.000%