INDEX
Explanations
future predictions or speculations by examining phrases that indicate extrapolation or anticipation
phrases related to fate and the passage of time
New Auto-Interp
Negative Logits
selage
-0.80
artney
-0.75
outheastern
-0.73
sidx
-0.70
inia
-0.69
unal
-0.69
ocial
-0.69
ebted
-0.68
iatric
-0.68
ospons
-0.67
POSITIVE LOGITS
dictates
0.99
dictate
0.95
abound
0.83
trump
0.82
intervened
0.76
Kinnikuman
0.74
trick
0.71
tricks
0.71
prevailed
0.71
favor
0.69
Activations Density 0.442%