INDEX
Explanations
words with the term "so-called."
phrases that include the term "so-called," indicating a skepticism or critique of labels or classifications
New Auto-Interp
Negative Logits
=-=-=-=-
-0.90
=-=-=-=-=-=-=-=-
-0.82
erves
-0.78
unden
-0.78
=-=-
-0.76
istg
-0.74
destro
-0.69
*=-
-0.69
Ö¼
-0.68
proport
-0.67
POSITIVE LOGITS
called
0.83
erred
0.82
untarily
0.77
holes
0.72
metadata
0.72
calling
0.72
amn
0.72
call
0.71
unt
0.70
nant
0.69
Activations Density 0.023%