INDEX
Explanations
phrases indicating an extreme level of intensity or effectiveness
New Auto-Interp
Negative Logits
elsen
-0.70
former
-0.68
eto
-0.67
ajo
-0.66
eners
-0.66
Destination
-0.66
Frames
-0.66
igi
-0.66
isms
-0.65
Runner
-0.64
POSITIVE LOGITS
difficult
1.03
rare
0.97
important
0.94
valuable
0.94
versatile
0.93
unlikely
0.93
expensive
0.92
beneficial
0.91
efficient
0.88
useful
0.88
Activations Density 0.037%