INDEX
Explanations
words related to breaking or exceeding limits or rules
instances of the abbreviation "AF" or variations of "af."
New Auto-Interp
Negative Logits
Fargo
-0.72
Patriarch
-0.69
Mothers
-0.66
DOWN
-0.64
disinfect
-0.64
Mour
-0.63
PASS
-0.63
MER
-0.62
expectancy
-0.61
offer
-0.61
POSITIVE LOGITS
rican
1.26
rica
1.16
icion
1.09
ghan
0.96
riad
0.96
avorite
0.95
athom
0.93
riend
0.90
eties
0.89
eatures
0.87
Activations Density 0.010%