INDEX
Explanations
phrases indicating conclusions or final thoughts
phrases that indicate conclusions or summarizations
New Auto-Interp
Negative Logits
activated
-0.83
cler
-0.73
otin
-0.72
pes
-0.72
drivers
-0.70
enta
-0.69
href
-0.67
opic
-0.65
nurs
-0.64
apt
-0.64
POSITIVE LOGITS
conclude
1.05
concluding
1.02
concludes
0.96
conclusion
0.92
Conclusion
0.81
reement
0.77
concluded
0.77
FANTASY
0.76
iary
0.75
conclusions
0.74
Activations Density 0.008%