INDEX
Explanations
phrases related to past states of being or situations that have changed over time
phrases indicating a transformation or transition from one state to another
New Auto-Interp
Negative Logits
Retrieved
-0.82
2018
-0.74
wake
-0.70
Cosponsors
-0.69
update
-0.69
2018
-0.68
Extend
-0.68
Recall
-0.67
"]=>
-0.66
Whereas
-0.66
POSITIVE LOGITS
unthinkable
1.24
taboo
1.04
unimaginable
1.02
innocuous
1.00
harmless
0.97
dormant
0.96
regarded
0.92
thinkable
0.92
unheard
0.90
timid
0.87
Activations Density 0.267%