INDEX
Explanations
Filipino words related to government, such as titles, names, and locations
New Auto-Interp
Negative Logits
IAL
-0.81
AX
-0.80
ington
-0.78
ees
-0.77
APH
-0.77
FX
-0.74
atories
-0.74
ipal
-0.73
oids
-0.73
furt
-0.73
POSITIVE LOGITS
lang
0.87
tha
0.84
bab
0.78
mah
0.72
opio
0.69
sar
0.68
bes
0.68
la
0.66
sher
0.66
buck
0.65
Activations Density 0.057%