INDEX
Explanations
verbs related to human behavior
instances of the word "behave" and its variations
New Auto-Interp
Negative Logits
Solo
-0.73
occup
-0.69
export
-0.68
fram
-0.66
user
-0.65
ondo
-0.64
source
-0.64
ixed
-0.63
Herz
-0.63
val
-0.62
POSITIVE LOGITS
behave
1.11
behaved
1.08
behaves
1.07
behaving
0.93
behavi
0.93
iments
0.93
behav
0.89
wcsstore
0.87
behaviours
0.86
uations
0.86
Activations Density 0.010%