INDEX
Explanations
mentions of a specific person or entity named "Av"
references to a specific entity or person named "Av."
New Auto-Interp
Negative Logits
utenant
-0.77
hyde
-0.70
inaccessible
-0.66
zsche
-0.63
FORMATION
-0.63
ciplinary
-0.61
bered
-0.61
Barnett
-0.59
towels
-0.59
cooker
-0.59
POSITIVE LOGITS
atars
1.12
ril
1.04
ocado
1.03
iator
0.99
iol
0.98
raham
0.98
iew
0.98
iva
0.95
atar
0.94
ille
0.94
Activations Density 0.017%