INDEX
Explanations
references to individuals and their roles in discussions or arguments
New Auto-Interp
Negative Logits
utafitiHapana
-0.46
ValueStyle
-0.45
Descripció
-0.42
UnsafeEnabled
-0.38
Tell
-0.37
ху
-0.37
BeginInit
-0.37
astéroïdes
-0.36
verkla
-0.36
انتهای
-0.36
POSITIVE LOGITS
mentioned
1.23
mention
1.23
touched
1.14
mentions
1.12
mentioning
1.10
alluded
1.04
Mention
0.98
mention
0.98
mencionó
0.97
Mentioned
0.96
Activations Density 0.681%