INDEX
Explanations
references to governmental actions and authority
New Auto-Interp
Negative Logits
########.
-0.83
acoper
-0.83
viață
-0.81
seamnă
-0.80
parsedMessage
-0.77
picioare
-0.76
noastre
-0.76
noastră
-0.76
rând
-0.76
entendido
-0.72
POSITIVE LOGITS
con
0.99
pro
0.81
com
0.80
func
0.72
sys
0.69
nomi
0.69
elec
0.65
pre
0.65
ex
0.65
busi
0.64
Activations Density 0.224%