INDEX
Explanations
a variety of activation values indicating potential relevance in contextual phrases
New Auto-Interp
Negative Logits
Waray
-0.69
abetes
-0.64
viewType
-0.58
Tembelea
-0.58
|
-0.57
doInBackground
-0.56
VersionUID
-0.55
[
-0.55
(
-0.52
()
-0.52
POSITIVE LOGITS
<eos>
1.89
<unused60>
1.19
<unused63>
1.17
<unused61>
1.09
</em>
0.81
</code>
0.78
</i>
0.71
</u>
0.64
</blockquote>
0.58
Portály
0.55
Activations Density 0.486%