INDEX
Explanations
proper nouns
names of individuals, particularly emphasizing a character named Morales
New Auto-Interp
Negative Logits
sled
-0.73
BT
-0.72
peac
-0.71
Kitty
-0.70
Nato
-0.69
Britain
-0.68
ilk
-0.68
autumn
-0.67
drawn
-0.66
âĹ¼
-0.65
POSITIVE LOGITS
Morales
2.61
Guerrero
2.59
Ramirez
2.41
Gonzalez
2.38
Herrera
2.37
Ramos
2.35
Sanchez
2.32
Ortiz
2.31
Reyes
2.30
Hernandez
2.30
Activations Density 0.069%