INDEX
Explanations
time-related terms and actions
numeric values and related temporal phrases
New Auto-Interp
Negative Logits
SY
-0.85
Tele
-0.76
Hyp
-0.72
Ult
-0.72
Bul
-0.72
Tor
-0.71
SY
-0.71
Hyd
-0.71
Rocket
-0.70
Obs
-0.70
POSITIVE LOGITS
locality
0.75
avenue
0.72
abase
0.72
avement
0.69
era
0.69
occupation
0.68
zone
0.67
cavity
0.66
iversary
0.65
bered
0.65
Activations Density 0.616%