INDEX
Explanations
expressions and phrases that describe subjective experiences or perceptions
New Auto-Interp
Negative Logits
Similar
-0.90
Similar
-0.84
Kinds
-0.84
similar
-0.82
Kind
-0.75
Kinds
-0.70
kinds
-0.69
Types
-0.69
KIND
-0.69
Types
-0.65
POSITIVE LOGITS
lenker
0.57
lipop
0.54
li
0.52
slidesToShow
0.50
matchCondition
0.50
ようになった
0.49
pinMode
0.49
liked
0.49
edan
0.48
ようになる
0.48
Activations Density 0.125%