INDEX
Explanations
phrases related to impactful or important statements
sentences that contain strong emotional statements or reflections
New Auto-Interp
Negative Logits
respectively
-0.86
remaining
-0.79
wet
-0.77
allowance
-0.77
touring
-0.76
manageable
-0.74
overall
-0.70
separately
-0.70
minim
-0.70
rigorous
-0.70
POSITIVE LOGITS
Called
1.45
Something
1.16
Someone
1.12
Specifically
1.01
Suppose
0.98
Somebody
0.97
Referred
0.96
Apparently
0.95
Someone
0.95
Intent
0.93
Activations Density 0.619%