INDEX
Explanations
verbs that indicate action or activity
statements about capabilities and functions
New Auto-Interp
Negative Logits
lie
-0.72
ã
-0.70
oppy
-0.66
ilater
-0.65
igl
-0.63
osaurus
-0.62
1111
-0.62
aides
-0.59
jac
-0.58
INT
-0.58
POSITIVE LOGITS
prototype
0.74
£ı
0.73
tesy
0.72
founder
0.71
ONSORED
0.66
trending
0.66
partnered
0.65
premier
0.65
Plugin
0.63
utenberg
0.61
Activations Density 0.567%