INDEX
Explanations
expressions of personal views and perspectives on various topics
New Auto-Interp
Negative Logits
bietern
-0.55
AssemblyProduct
-0.48
TagHelper
-0.47
urm
-0.46
Pratique
-0.45
partimento
-0.44
таратура
-0.42
ťou
-0.42
ResourceManager
-0.41
proper
-0.41
POSITIVE LOGITS
opinions
1.04
opinion
1.00
Opinions
0.99
opinions
0.93
conclusions
0.90
thoughts
0.90
Opinions
0.89
للمعارف
0.87
Opinion
0.85
comments
0.84
Activations Density 0.612%