INDEX
Explanations
phrases that indicate locations or conditions
New Auto-Interp
Negative Logits
ρης
-0.73
متعلقه
-0.70
utnik
-0.68
ScopeManager
-0.65
EndProject
-0.64
Hentet
-0.63
/**
-0.63
transfieras
-0.62
Ծանոթ
-0.60
gynhyrchwyd
-0.59
POSITIVE LOGITS
where
1.02
Where
0.99
Where
0.98
WHERE
0.91
where
0.87
Onde
0.81
WHERE
0.76
donde
0.74
где
0.72
onde
0.72
Activations Density 0.179%