INDEX
Explanations
assertions or beliefs about people's actions or characteristics
New Auto-Interp
Negative Logits
therefore
-0.63
daher
-0.56
therefore
-0.55
+#+
-0.54
todėl
-0.53
tuttavia
-0.52
übrigens
-0.51
hence
-0.51
Therefore
-0.50
however
-0.50
POSITIVE LOGITS
таратура
0.72
mergeFrom
0.70
modelBuilder
0.69
'}>
0.67
"]];
0.66
"}>
0.65
__":
0.65
."));
0.65
])):
0.64
المعيارى
0.64
Activations Density 0.361%