INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
reserv
-0.70
iors
-0.68
Reserve
-0.66
ality
-0.66
favour
-0.64
Committees
-0.64
iod
-0.63
favor
-0.63
arity
-0.62
Ratio
-0.61
POSITIVE LOGITS
Adams
0.83
00200000
0.78
âĶĢâĶĢâĶĢâĶĢ
0.75
encer
0.74
âĩ
0.72
è£ıè¦ļéĨĴ
0.70
inez
0.69
osta
0.68
POLITICO
0.68
?]
0.67
Activations Density 0.000%
No Known Activations
This feature has no known activations.