INDEX
Explanations
negations or obstacles
expressions related to obstacles or difficulties
New Auto-Interp
Negative Logits
fundamentally
-0.78
radically
-0.66
matically
-0.66
Timeline
-0.66
ocracy
-0.65
historically
-0.62
inherently
-0.61
ually
-0.61
Closed
-0.60
basically
-0.60
POSITIVE LOGITS
inconvenience
1.11
consolation
1.07
avail
0.97
assistance
0.94
inconven
0.89
distraction
0.88
Advantage
0.85
welcome
0.85
annoyance
0.83
encouragement
0.82
Activations Density 0.483%