INDEX
Explanations
specific dietary patterns and their classifications
New Auto-Interp
Negative Logits
istisches
-0.60
rather
-0.49
.
-0.47
and
-0.46
but
-0.45
Bet
-0.44
mis
-0.44
/
-0.43
så
-0.42
side
-0.42
POSITIVE LOGITS
reaſon
0.94
themſelves
0.92
ſelf
0.91
Efq
0.91
ſtand
0.91
Theſe
0.90
itſelf
0.89
AssemblyCompany
0.88
myſelf
0.87
whoſe
0.87
Activations Density 0.081%