INDEX
Explanations
mentions of political affiliations and groups, particularly in the context of lawmakers and legislation
commas that indicate list-like structures or separations in complex information
New Auto-Interp
Negative Logits
teenth
-0.72
washer
-0.66
ield
-0.66
eur
-0.65
ģĸ
-0.63
=#
-0.62
gap
-0.61
asis
-0.59
tha
-0.58
iku
-0.58
POSITIVE LOGITS
including
1.23
including
1.09
particularly
1.04
Including
1.00
especially
0.99
meanwhile
0.97
rejoice
0.96
however
0.96
notably
0.90
moreover
0.89
Activations Density 0.200%