INDEX
Explanations
years or time periods in phrases
references to specific years and financial figures
New Auto-Interp
Negative Logits
desks
-0.77
scanner
-0.65
contr
-0.65
Free
-0.62
corners
-0.61
Sov
-0.61
TEXT
-0.60
Dialogue
-0.60
antit
-0.60
igraph
-0.59
POSITIVE LOGITS
onwards
0.98
onward
0.75
etheus
0.73
govtrack
0.71
netflix
0.68
ndra
0.67
teness
0.64
Acceler
0.64
Cance
0.63
hua
0.63
Activations Density 0.359%