INDEX
Explanations
references to dialogues and conversations within the text
New Auto-Interp
Negative Logits
:+:
-0.80
*{\-0.77
Bezirks
-0.76
newOwner
-0.74
########.
-0.73
XCTest
-0.72
#+#
-0.72
تضيفلها
-0.72
存于互联网档案馆
-0.69
لاثة
-0.68
POSITIVE LOGITS
dialog
1.94
Dialog
1.80
dialog
1.70
DIALOG
1.68
MatDialog
1.60
DIALOG
1.60
Dialog
1.55
dialogue
1.46
dialogues
1.36
Dialogue
1.36
Activations Density 0.041%