INDEX
Explanations
the word "smart" with a high activation value
instances of the word "smart" used in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.84
Divinity
-0.82
Reloaded
-0.81
ulhu
-0.76
alogue
-0.71
riott
-0.71
hedral
-0.68
channelAvailability
-0.67
ãĥĺãĥ©
-0.67
akeru
-0.66
POSITIVE LOGITS
ctl
0.90
ling
0.89
ness
0.89
guy
0.88
sonian
0.85
matic
0.85
ly
0.83
ass
0.79
ies
0.78
ening
0.77
Activations Density 0.016%