INDEX
Explanations
points in time or instances of events
mentions of specific moments or instances in time
New Auto-Interp
Negative Logits
ords
-0.67
yet
-0.65
uno
-0.64
atts
-0.63
GES
-0.63
ÃŁ
-0.60
Frames
-0.60
advertisement
-0.58
rats
-0.58
galitarian
-0.58
POSITIVE LOGITS
during
1.16
throughout
0.89
ago
0.87
thereafter
0.82
apiece
0.77
along
0.76
during
0.74
midway
0.73
glance
0.71
arf
0.70
Activations Density 0.059%