INDEX
Explanations
references to the word "fat" in various contexts
New Auto-Interp
Negative Logits
aug
-0.17
ByExample
-0.16
istrovstvÃŃ
-0.16
thon
-0.16
gaard
-0.16
eer
-0.16
ViewState
-0.15
edes
-0.15
Sesso
-0.15
itzer
-0.15
POSITIVE LOGITS
igue
0.33
ima
0.30
uous
0.29
ality
0.27
ig
0.27
uously
0.24
ernity
0.23
igure
0.22
uity
0.22
igate
0.21
Activations Density 0.006%