INDEX
Explanations
medical expertise and references to healthcare professionals
Follows "to" or certain punctuation marks
older english and french influence
New Auto-Interp
Negative Logits
pretty
-0.74
clueless
-0.69
supposedly
-0.68
-0.64
actually
-0.63
messed
-0.63
weirdly
-0.63
вроде
-0.63
tricky
-0.62
T
-0.62
POSITIVE LOGITS
itſelf
1.02
myſelf
0.98
ainfi
0.95
Monfieur
0.94
་་
0.94
pleaſure
0.92
auffi
0.92
ſelves
0.91
enfans
0.89
purpoſe
0.89
Activations Density 0.258%