INDEX
Explanations
phrases or words in a specific foreign language
occurrences of a specific character or symbol
New Auto-Interp
Negative Logits
ORED
-0.85
Sussex
-0.76
IFIED
-0.73
guiActiveUnfocused
-0.68
Jericho
-0.65
Mayweather
-0.62
actors
-0.61
Bullets
-0.60
URES
-0.58
bearer
-0.58
POSITIVE LOGITS
ä
1.23
inen
1.16
¢
1.10
¶
1.00
·
0.99
ternity
0.98
ki
0.94
hl
0.93
tten
0.90
î
0.90
Activations Density 0.013%