INDEX
Explanations
words related to physical health conditions, particularly those related to muscles and nerves
medical and scientific terminology, particularly related to physical health conditions and substances
New Auto-Interp
Negative Logits
appro
-0.67
judgment
-0.66
aston
-0.63
Germany
-0.62
ourced
-0.62
judgement
-0.61
NSA
-0.60
Pathfinder
-0.60
keeper
-0.60
Labour
-0.59
POSITIVE LOGITS
phy
1.29
asus
1.08
xit
0.99
xia
0.98
ptoms
0.94
onies
0.93
lly
0.93
lla
0.92
ertodd
0.91
nces
0.90
Activations Density 0.005%