INDEX
Explanations
phrases related to time or temporal comparisons
New Auto-Interp
Negative Logits
mates
-0.79
McGee
-0.79
mate
-0.76
ohyd
-0.75
reditary
-0.75
salads
-0.75
uay
-0.74
ttes
-0.74
Leilan
-0.73
arthed
-0.73
POSITIVE LOGITS
vein
0.99
level
0.79
rity
0.77
denomination
0.72
IFF
0.72
shaky
0.71
token
0.70
expense
0.68
exact
0.68
rate
0.67
Activations Density 3.882%