INDEX
Explanations
special characters or symbols in the text
New Auto-Interp
Negative Logits
č
-0.19
Behaviour
-0.16
behaviour
-0.14
Behaviour
-0.14
č
-0.14
î
-0.13
â
-0.13
âm
-0.13
jewellery
-0.13
à¸ĸม
-0.13
POSITIVE LOGITS
Pope
0.21
Vatican
0.20
Francis
0.19
Trump
0.18
Franc
0.17
Benedict
0.17
Rome
0.17
Francisco
0.16
Numero
0.15
Cardinals
0.15
Activations Density 0.001%