INDEX
Explanations
questions about specific word usage and phrasing
New Auto-Interp
Negative Logits
…
-0.49
L
-0.44
ground
-0.43
due
-0.43
l
-0.42
e
-0.42
[…]
-0.42
-0.41
ka
-0.40
Mag
-0.39
POSITIVE LOGITS
kaarangay
1.16
MessageOf
1.09
betweenstory
1.00
ویکیپدی
0.98
featureID
0.97
propOrder
0.93
astéroïdes
0.92
richTextPanel
0.92
ьаж
0.91
pinulongan
0.90
Activations Density 0.592%