INDEX
Explanations
phrases indicating an extreme or critical situation
references to extreme levels or thresholds of various situations or emotions
New Auto-Interp
Negative Logits
DRAG
-0.71
Ging
-0.64
Deal
-0.58
"$:/
-0.57
Spend
-0.55
Drag
-0.53
filler
-0.53
Surviv
-0.53
Span
-0.53
Grant
-0.52
POSITIVE LOGITS
extent
1.07
detriment
0.93
fullest
0.76
venge
0.72
tune
0.72
approximation
0.72
podium
0.71
lengths
0.70
brink
0.70
extremes
0.69
Activations Density 0.240%