INDEX
Explanations
acronyms related to organizations or institutions
references to organizations or entities, particularly those relating to humane societies or health services
New Auto-Interp
Negative Logits
dylib
-0.75
fman
-0.75
Samoa
-0.74
verages
-0.73
bidden
-0.71
tail
-0.66
oshenko
-0.66
stru
-0.66
taboola
-0.65
ezvous
-0.65
POSITIVE LOGITS
ocial
1.11
HS
1.03
IFT
0.93
Ds
0.92
ospital
0.89
INESS
0.85
BC
0.84
HT
0.84
alez
0.82
EMA
0.82
Activations Density 0.012%