INDEX
Explanations
the word "af"
words related to deafness or hearing impairment
New Auto-Interp
Negative Logits
imens
-0.81
tremend
-0.76
generalized
-0.72
besie
-0.71
arcer
-0.71
accomp
-0.68
uters
-0.68
unification
-0.66
psychiat
-0.66
ournal
-0.66
POSITIVE LOGITS
bread
0.97
cakes
0.93
cake
0.89
bones
0.87
hawk
0.86
horn
0.85
ings
0.84
lings
0.83
beard
0.82
cock
0.79
Activations Density 0.159%