INDEX
Explanations
names of individuals
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
ModLoader
-0.80
Confederation
-0.80
culosis
-0.79
ashtra
-0.69
âĶĢâĶĢ
-0.69
Dhabi
-0.67
terday
-0.67
renheit
-0.66
>>\
-0.65
duino
-0.65
POSITIVE LOGITS
isner
0.83
oner
0.83
aney
0.82
antz
0.81
utsch
0.80
agan
0.78
agg
0.78
onson
0.78
iggs
0.74
iso
0.74
Activations Density 0.198%