INDEX
Explanations
references to the term "Fat"
repeated references to the word "Fat."
New Auto-Interp
Negative Logits
uden
-0.90
incoln
-0.72
cov
-0.71
PLIED
-0.70
espie
-0.69
eele
-0.69
oresc
-0.69
endale
-0.67
roman
-0.66
imbabwe
-0.66
POSITIVE LOGITS
Fat
3.91
Fat
3.08
fat
2.13
FAT
2.04
fat
1.90
fatty
1.43
fats
1.27
Abbas
1.23
Thin
1.17
Sad
1.14
Activations Density 0.014%