INDEX
Negative Logits
seves
1.14
Their
1.02
их
0.97
Their
0.95
它们的
0.92
Architect
0.91
Config
0.91
How
0.90
loro
0.89
Configurations
0.88
POSITIVE LOGITS
familiarity
1.09
thoughtful
1.07
adherence
1.07
unwillingness
1.07
firepower
1.07
ammonia
1.05
scrutiny
1.05
intimidation
1.04
aeration
1.02
sarcasm
1.02
Activations Density 0.511%