INDEX
Explanations
questions starting with "What makes" and "What distinguishes" and discusses differences or qualities that set things apart from others
questions that seek to understand the defining qualities or characteristics of various subjects
New Auto-Interp
Negative Logits
lance
-0.86
erenn
-0.73
abad
-0.73
ixel
-0.71
EEE
-0.68
estern
-0.68
rentice
-0.67
bill
-0.67
pard
-0.66
lex
-0.66
POSITIVE LOGITS
Sanct
0.76
distinguishes
0.71
pires
0.70
motiv
0.68
Fit
0.66
Cu
0.65
ISIL
0.64
Klu
0.64
orno
0.64
motivating
0.63
Activations Density 0.087%