INDEX
Explanations
mentions of dates or periods of time
definite and demonstrative articles, indicating the presence or importance of specific entities
New Auto-Interp
Negative Logits
Allows
-0.70
/-
-0.68
occupations
-0.66
ItemThumbnailImage
-0.65
SPONSORED
-0.64
MAL
-0.64
MENT
-0.64
TION
-0.63
ILCS
-0.62
VERTISEMENT
-0.61
POSITIVE LOGITS
proverbial
0.92
finger
0.86
needles
0.83
veil
0.80
fingers
0.78
dice
0.78
hairs
0.77
entire
0.76
globe
0.76
whole
0.76
Activations Density 0.355%