INDEX
Explanations
questions mentioned in news or interviews
the phrase "when asked," indicating inquiries or questions posed to individuals
New Auto-Interp
Negative Logits
mit
-0.68
Es
-0.66
ãĥĩãĤ£
-0.65
ports
-0.64
marine
-0.64
xon
-0.63
burgh
-0.62
76561
-0.61
tten
-0.61
tiny
-0.59
POSITIVE LOGITS
quizz
1.00
rhet
0.90
posed
0.83
ioned
0.79
asked
0.78
probing
0.73
questions
0.70
questioned
0.67
confronted
0.66
erville
0.66
Activations Density 0.026%