INDEX
Explanations
adjectives and phrases indicating a sense of extremity or surpassing limits
references to limits or constraints beyond one's control
New Auto-Interp
Negative Logits
differently
-0.87
ahead
-0.73
behind
-0.72
DAY
-0.69
alike
-0.67
soType
-0.66
followed
-0.64
dylib
-0.63
instead
-0.61
md
-0.61
POSITIVE LOGITS
bounds
1.28
confines
1.13
boundaries
1.06
horizon
1.03
comprehension
0.97
threshold
0.96
limits
0.96
borders
0.94
fray
0.85
scope
0.85
Activations Density 0.158%