INDEX
Explanations
features and descriptions of diverse topics or items
New Auto-Interp
Negative Logits
فريبيس
-0.83
carina
-0.68
ngOnInit
-0.59
<bos>
-0.56
lück
-0.53
kecil
-0.53
تانيه
-0.52
htdocs
-0.52
deschis
-0.52
собенно
-0.51
POSITIVE LOGITS
includes
1.13
include
1.08
Includes
1.05
Includes
1.03
includes
0.96
INCLUDES
0.94
Include
0.93
INCLUDES
0.92
a
0.90
comprises
0.89
Activations Density 0.462%