INDEX
Explanations
references to quotes or statements made by different individuals
expressions of surprise or strong emotions
New Auto-Interp
Negative Logits
enment
-0.57
}.
-0.54
ankind
-0.53
inis
-0.51
illion
-0.50
.'
-0.49
.$
-0.48
iky
-0.48
)--
-0.48
vertisement
-0.47
POSITIVE LOGITS
"â̦
0.65
"
0.61
"'
0.61
"[
0.57
"...
0.57
"#
0.53
misunderstood
0.52
underestimated
0.51
Blumenthal
0.51
"))
0.51
Activations Density 2.480%