INDEX
Explanations
mentions of specific names and locations, such as names of people, places, and organizations
New Auto-Interp
Negative Logits
ratulations
-0.83
carbohyd
-0.77
dinand
-0.73
cffff
-0.73
rification
-0.72
uyomi
-0.72
geant
-0.71
awks
-0.71
nesota
-0.70
alties
-0.70
POSITIVE LOGITS
star
0.86
light
0.76
runner
0.73
fall
0.73
smith
0.69
more
0.68
board
0.67
isance
0.66
bringer
0.66
str
0.65
Activations Density 0.343%