INDEX
Explanations
phrases and terms related to intentions and objectives
New Auto-Interp
Negative Logits
principalTable
-0.93
विश्वसनीयता
-0.91
MainAxisSize
-0.85
PhysRev
-0.83
ligiloj
-0.80
चीज़ों
-0.77
EndContext
-0.76
OGND
-0.76
متعلقه
-0.74
Aspiration
-0.74
POSITIVE LOGITS
allowed
0.59
think
0.55
aimed
0.53
allow
0.49
aim
0.49
us
0.47
put
0.46
体
0.46
made
0.45
<bos>
0.45
Activations Density 0.139%