INDEX
Explanations
phrases related to negative judgments on health and fitness
references to negative health conditions and characteristics
New Auto-Interp
Negative Logits
soType
-0.95
glas
-0.84
lance
-0.81
ership
-0.76
sheets
-0.75
DragonMagazine
-0.74
itivity
-0.74
ibel
-0.74
anmar
-0.73
inical
-0.72
POSITIVE LOGITS
unnatural
0.80
wastes
0.76
costly
0.73
unreasonable
0.72
aber
0.71
contradictory
0.68
unnecessary
0.67
unreliable
0.67
bog
0.67
inefficient
0.66
Activations Density 0.027%