INDEX
Explanations
quantitative comparisons or approximations
phrases that indicate approximate quantities or statistics
New Auto-Interp
Negative Logits
Characters
-0.72
Sent
-0.69
ose
-0.68
actions
-0.66
deed
-0.65
POV
-0.64
deeds
-0.63
bearer
-0.63
aders
-0.62
ocy
-0.62
POSITIVE LOGITS
560
0.87
580
0.86
tripled
0.85
twice
0.83
380
0.83
doubled
0.81
370
0.80
260
0.80
670
0.79
450
0.78
Activations Density 0.123%