INDEX
Explanations
proper nouns related to healthcare settings or professions
occurrences of the word "Care" and its variations
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.79
perjury
-0.65
extrad
-0.62
awe
-0.62
integer
-0.62
RESULTS
-0.61
sql
-0.61
lap
-0.61
Chomsky
-0.59
poppy
-0.58
POSITIVE LOGITS
taker
1.30
care
0.98
giving
0.98
er
0.97
fully
0.96
tta
0.87
ndra
0.86
lli
0.86
llan
0.85
taking
0.83
Activations Density 0.014%