INDEX
Explanations
dates or time-related phrases
New Auto-Interp
Negative Logits
pta
-0.76
BILITIES
-0.72
abus
-0.69
amina
-0.69
anie
-0.68
aves
-0.65
vantage
-0.65
bear
-0.64
obic
-0.63
immer
-0.62
POSITIVE LOGITS
rely
1.37
inception
1.13
1945
0.91
1999
0.91
1979
0.91
1998
0.90
2006
0.90
2009
0.90
2005
0.89
1995
0.88
Activations Density 0.848%