INDEX
Explanations
references to prominence and recognition in public figures or events
New Auto-Interp
Negative Logits
persons
-0.44
UrlResolution
-0.42
rators
-0.41
-0.38
idiots
-0.38
ویکیپدیای
-0.38
IVEREF
-0.37
persons
-0.37
nôtre
-0.37
ieuses
-0.37
POSITIVE LOGITS
former
0.99
ehemalige
0.69
veteran
0.68
former
0.68
native
0.67
Englishman
0.67
ex
0.65
charismatic
0.62
Irishman
0.60
erst
0.60
Activations Density 0.186%