INDEX
Explanations
years, specifically the year 2008
references to the year 2008
New Auto-Interp
Negative Logits
onite
-0.71
hma
-0.68
ancest
-0.67
unin
-0.66
unal
-0.65
Magikarp
-0.65
orate
-0.65
friends
-0.65
rails
-0.64
lying
-0.64
POSITIVE LOGITS
å¹
0.97
-'
0.81
é¾
0.80
2008
0.74
2008
0.73
aeda
0.72
2007
0.71
worthiness
0.67
2020
0.65
ilton
0.65
Activations Density 0.017%