INDEX
Explanations
mentions of the term "Roman"
references to Roman history and culture
New Auto-Interp
Negative Logits
intosh
-1.11
mble
-0.96
ramid
-0.85
NetMessage
-0.85
*/(
-0.83
olulu
-0.83
ilater
-0.80
razil
-0.80
lessly
-0.79
lesh
-0.78
POSITIVE LOGITS
Catholic
1.01
Catholicism
0.86
Reign
0.85
Torch
0.84
numer
0.82
Catholics
0.79
Inquisition
0.78
Roman
0.77
Pont
0.75
Emperor
0.75
Activations Density 0.012%