INDEX
Explanations
names or words ending in 'ling'
forms of the word "ling."
New Auto-Interp
Negative Logits
ailable
-0.71
raviolet
-0.70
ĻĤ
-0.70
schild
-0.69
raints
-0.68
Seym
-0.67
ORTS
-0.67
isites
-0.64
emis
-0.64
acco
-0.64
POSITIVE LOGITS
gren
1.00
gling
1.00
phrine
0.88
tons
0.87
ttes
0.84
ham
0.81
worth
0.80
berg
0.79
hammer
0.79
ishly
0.78
Activations Density 0.031%