INDEX
Explanations
references to historical contexts and figures associated with ancient Roman culture
New Auto-Interp
Negative Logits
ſelves
-0.75
ſelf
-0.73
myſelf
-0.72
Jefus
-0.71
unſ
-0.69
purpoſe
-0.69
Majefty
-0.69
Eſ
-0.68
Diſ
-0.68
Reſ
-0.67
POSITIVE LOGITS
romanos
0.51
civilización
0.44
römischen
0.43
desnuda
0.42
Danach
0.41
ladiator
0.41
griego
0.40
mengenal
0.39
pasaba
0.37
detalle
0.37
Activations Density 0.251%