INDEX
Explanations
phrases emphasizing the importance and necessity of certain actions or precautions
New Auto-Interp
Negative Logits
аки
-0.15
Ridley
-0.15
gem
-0.15
hd
-0.14
仲
-0.14
eken
-0.14
Gem
-0.14
gem
-0.14
æīįèĥ½
-0.14
iken
-0.14
POSITIVE LOGITS
wise
0.34
wise
0.31
Wise
0.30
-wise
0.26
helpful
0.24
wisdom
0.23
advisable
0.23
prudent
0.23
worthwhile
0.23
Wisdom
0.22
Activations Density 0.201%