INDEX
Explanations
mentions of diving or related actions
New Auto-Interp
Negative Logits
nakalista
-0.68
arbeits
-0.66
okam
-0.66
buro
-0.66
+#+#
-0.66
Nasir
-0.64
defecto
-0.64
Brien
-0.64
ladin
-0.63
لول
-0.63
POSITIVE LOGITS
Som
0.96
dive
0.93
Bien
0.88
Som
0.87
diving
0.85
Diving
0.85
Diving
0.84
Bien
0.83
som
0.83
som
0.82
Activations Density 0.214%