INDEX
Explanations
proper nouns related to names of people or places, more specifically names containing 'Alf' or 'Alc'
proper nouns and specific food-related terms
New Auto-Interp
Negative Logits
ledged
-0.85
pload
-0.82
achu
-0.80
ling
-0.71
soDeliveryDate
-0.71
ly
-0.67
kie
-0.66
apore
-0.66
pter
-0.66
Beware
-0.66
POSITIVE LOGITS
onso
1.17
arial
0.87
ayne
0.85
retri
0.84
ibaba
0.81
inia
0.78
ivia
0.78
manac
0.77
asca
0.75
onde
0.74
Activations Density 0.019%