INDEX
Explanations
proper nouns or names of entities
phrases indicating a group or a collective entity
New Auto-Interp
Negative Logits
anytime
-0.69
ario
-0.65
rification
-0.61
ober
-0.60
translates
-0.60
ysc
-0.59
culosis
-0.58
olation
-0.58
introduction
-0.58
commute
-0.57
POSITIVE LOGITS
those
0.81
ĪĴ
0.73
IJ
0.72
casualties
0.71
those
0.71
whom
0.66
Sources
0.66
addons
0.66
innumerable
0.66
stad
0.64
Activations Density 0.028%