Sample stimuli

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("MajajHong2015.IT-pls")
score = benchmark(my_model)

Benchmark API

Code examples

Model scores

Score Legend Min Alignment Max Alignment

Rank	Model	Score
1	resnet50-barlow	.574
2	barlow-twins-resnet50	.572
3	vonegrcnn_52e_full	.569
4	resnet50-vicregl0p9	.568
5	resnet50-vicreg	.566
6	vonegrcnn_62e_nobn	.566
7	AT_efficientnet-b7	.562
8	custom_model_cv_18_dagger_408	.561
9	resnet50-vicregl0p75	.561
10	resnet-18-LC_w_sh_1_iter_conv_init_m	.561
11	effnetb1_cutmixrespatch_SAM_robust32_e2_manylayers_324x288	.560
12	effnetb1_cutmixpatch_SAM_robust32_avge6e8e9e10_manylayers_324x288	.560
13	effnetb1_cutmixrespatch_SAM_robust32_e10_manylayers_324x288	.560
14	ReAlnet10_cornet	.559
15	AT_efficientnet-b4	.559
16	AdvProp_efficientnet-b4	.559
17	ReAlnet03_cornet	.558
18	effnetb1_cutmix_augmix_sam_e1_5avg_424x377	.556
19	ReAlnet07_cornet	.555
20	resnet-50_v1-tf	.555
21	ReAlnet08_cornet	.555
22	BiT-M-R50x3	.554
23	shufflenet_v2_x1_0	.554
24	mobilenet_v2_1.4_224-tf	.554
25	antialiased-rnext101_32x8d	.554
26	ReAlnet05_cornet	.554
27	resnet-152_v1-tf	.553
28	AdvProp_efficientnet-b6	.553
29	ReAlnet02_cornet	.553
30	ReAlnet01_cornet	.552
31	AdvProp_efficientnet-b2	.552
32	resnet50-SIN_IN	.552
33	ReAlnet06_cornet	.552
34	resnet50_robust_l2_eps1	.552
35	ReAlnet09_cornet	.552
36	VOneCORnet-S	.551
37	nasnet_mobile	.551
38	AT_efficientnet-b2	.550
39	mobilenet_v2_1.0_160-tf	.550
40	mobilenet_v2_0.75_192-tf	.550
41	ReAlnet04_cornet	.549
42	mobilenet_v1_1.0_160	.549
43	resnet-101_v1-tf	.549
44	resnet50_linf_4_robust	.549
45	voneresnet-50-robust	.549
46	efficientnet_b1_imagenet_full	.549
47	tv_efficientnet-b1	.548
48	mobilenet_v2_1_0_160	.548
49	densenet-169-keras	.548
50	resnet50-moclr8deg	.547
51	AdvProp_efficientnet-b7	.547
52	efficientnet-b0	.547
53	mobilenet_v2_1.0_192-tf	.547
54	mobilenet_v2_0.75_160-tf	.547
55	efficientnet-b2	.547
56	efficientnet_b2_imagenet_full	.546
57	densenet-169	.546
58	mobilenet_v1_1.0_224	.546
59	mobilenet_v2_1.0_224-tf	.546
60	densenet-201-keras	.545
61	inception_v3-tf	.545
62	resnet50-SIN_IN_IN	.545
63	antialiased-r50	.545
64	konkle_alexnetgn_ipcl_ref12_supervised_ipcl_aug	.545
65	resnet-18-LC_w_sh_10_iter_m	.544
66	densenet-201	.544
67	inception_resnet_v2	.543
68	BiT-M-R50x1	.543
69	voneresnet-50-non_stochastic	.543
70	efficientnet-b4	.543
71	mobilenet_v2_1.3_224-tf	.543
72	efficientnet-b7	.543
73	mobilenet_v1_1.0_192	.543
74	resnet50_l2_3_robust	.542
75	effnetb1_cutmixpatch_augmix_robust32_e5_324x288	.542
76	mobilenet_v2_0.75_128	.541
77	resnet50_imagenet_full	.541
78	resnet50_ecoset_full	.541
79	efficientnet-b6	.541
80	efficientnet_b0_imagenet_full	.541
81	vit_relpos_base_patch16_clsgap_224:sw_in1k	.541
82	mobilenet_v2_0_75_160	.541
83	inception_v3	.541
84	resnet50-simclr-vissl	.541
85	CORnet-S	.541
86	mobilenet_v2_1_0_192	.541
87	nasnet_large-tf	.541
88	mobilenet_v2_1_0_224	.540
89	AdvProp_efficientnet-b8	.540
90	AdvProp_efficientnet-b0	.540
91	voneresnet-50	.540
92	effnetb1_cutmixpatch_augmix_robust32_avge4e7_manylayers_324x288	.539
93	inception_v2-tf	.539
94	resnet50-sup	.539
95	resnet_50_v1	.539
96	mobilenet_v2_1_3_224	.538
97	resnet101_ecoset_full	.538
98	resnet50_moco_v2	.536
99	focalnet_tiny_lrf_in1k	.536
100	nasnet_mobile-tf	.536
101	resnet18-local_aggregation	.536
102	mobilenet_v2_0.75_224-tf	.535
103	mobilenet_v2_0_75_192	.535
104	cornetz_contrastive	.535
105	convnext_tiny_sup	.535
106	mobilenet_v1_0.75_224	.535
107	konkle_alexnetgn_ipcl_ref01_primary_model	.535
108	mobilenet_v1_0.75_192	.535
109	mobilenet_v1_1.0_128	.535
110	resnet152_ecoset_full	.534
111	resnet50-SIN	.534
112	mobilenet_v2_1.0_128-tf	.534
113	AT_efficientnet-b0	.534
114	effnetv2m_custom384	.534
115	BiT-S-R50x3	.534
116	densenet-121-keras	.533
117	convnext_xxlarge:clip_laion2b_soup_ft_in1k	.533
118	mobilenet_v2_0.5_224-tf	.533
119	densenet-121	.533
120	BiT-S-R50x1	.533
121	resnet-152_v2-tf	.532
122	mobilenet_v2_0_75_224	.532
123	resnet_SIN_IN_FT_IN	.532
124	mobilenet_v2_1_0_128	.532
125	convnext_tiny_imagenet_full_seed-0	.532
126	0.5x_resnet-18_LC_w_sh_1_iter	.531
127	resnet-18_test_m	.531
128	resnet-18	.531
129	resnet-18-LC_w_sh_10_iter_conv_init_m	.531
130	resnet-18-LC_w_sh_100_iter_conv_init_m	.531
131	Res2Net50_26w_4s	.531
132	convnext_small_imagenet_full_seed-0	.530
133	convnext_base_imagenet_full_seed-0	.530
134	resnet18_imagenet21kP	.530
135	effnetb1_272x240	.530
136	vonegrcnn_47e	.529
137	mobilenet_v2_0.5_192-tf	.529
138	mobilenet_v1_0.75_160	.529
139	mobilenet_v2_0.5_160	.528
140	convnext_base:clip_laiona_augreg_ft_in1k_384	.528
141	mobilenet_v2_1_4_224	.528
142	convnext_large:fb_in22k_ft_in1k	.527
143	resnet101_imagenet_full	.527
144	mobilenet_v2_1-4_224_pytorch	.527
145	efficientnet_b0	.527
146	resnet-101_v2-tf	.527
147	convnext_femto_ols:d1_in1k	.527
148	inception_v4-tf	.526
149	resnet18-supervised	.526
150	pnasnet_large-tf	.526
151	convnext_large_imagenet_full_seed-0	.525
152	mobilenet_v2_0.5_128	.525
153	resnet18-simclr	.524
154	resnet50-vitoimagevidnet8	.523
155	convnext_xlarge:fb_in22k_ft_in1k	.523
156	mobilenet_v2_0_5_224	.523
157	antialias-resnet152	.522
158	resnet18-instance_recognition	.522
159	BiT-M-R101x3	.521
160	mobilenet_v1_0.5_224	.521
161	resnet-50_v2-tf	.520
162	mobilenet_v2_0_5_192	.520
163	resnet50_robust_l2_eps3	.520
164	mobilenet_v1_0.5_192	.520
165	resnet18_imagenet_full	.520
166	resnet101	.519
167	resnet50_random_linf8_perturb	.519
168	0.5x_resnet-18_LC_w_sh_10_iter	.518
169	vgg-16-keras	.518
170	mobilenet_v2_0.35_192	.518
171	resnet-50-pytorch	.518
172	resnet50_finetune_cutmix_e3_robust_linf8255_e0_247x234	.518
173	mobilenet_v2_0.35_160	.517
174	yudixie_resnet50_imagenet1kpret_0_240312	.517
175	resnet50_tutorial	.517
176	yudixie_resnet50_imagenet1kpret_0_240908	.517
177	resnet50_ImageNet	.517
178	vit_relpos_base_patch32_plus_rpn_256:sw_in1k	.517
179	convnext_tiny:in12k_ft_in1k	.516
180	resnet34_imagenet_full	.516
181	mobilenet_v2_0.75_96	.516
182	deit_large_imagenet_full_seed-0	.515
183	BiT-S-R152x4	.515
184	resnet-18-LC_w_sh_100_iter_m	.515
185	BiT-M-R152x2	.515
186	CLIP-RN50	.515
187	resnet50_finetune_cutmix_AVGe2e3_robust_linf8255_e0_247x234	.515
188	mobilenet_v2_1.0_96	.514
189	grcnn_v2_text_noise	.513
190	resnext101_32x48d_wsl	.513
191	fixres_resnext101_32x48d_wsl	.513
192	cv_18_dagger_408_pretrained	.513
193	resnet50_random_l2_perturb	.512
194	vgg_16	.512
195	resnet50_simclr	.512
196	resnet50-VITO-8deg-cc	.512
197	resnet152_imagenet_full	.512
198	resnet34_ecoset_full	.511
199	resnet_50_v2	.510
200	resnet18_ecoset_full	.510
201	resnet50	.510
202	convnext_large_mlp:clip_laion2b_augreg_ft_in1k_384	.510
203	resnet-18-LC_w_sh_1_iter_m	.509
204	alexnet_ks_torevert	.508
205	alexnet	.508
206	alexnet	.508
207	alexnet	.508
208	alexnet-baseline	.508
209	alexnet	.508
210	alexnet	.508
211	alexnet	.508
212	alexnet	.508
213	alexnet	.508
214	alexnet	.508
215	BiT-M-R152x4	.508
216	ReAlnet10	.508
217	resnext101_32x32d_wsl	.507
218	ReAlnet07	.507
219	ReAlnet02	.507
220	deit_base_imagenet_full_seed-0	.507
221	ReAlnet08	.506
222	mobilenet_v2_0.35_224	.506
223	yudixie_resnet18_imagenet1kpret_0_240719	.506
224	resnet18-deepcluster	.506
225	SWSL_resnet50	.506
226	vit_base_patch16_clip_224:openai_ft_in12k_in1k	.505
227	mobilenet_v1_0.75_128	.505
228	BiT-S-R101x3	.505
229	CLIP_ViT-B_32	.504
230	alexnet_random_l2_3_perturb	.504
231	RN50	.504
232	resnet_101_v2	.503
233	mobilenet_v1_0.5_160	.503
234	ReAlnet06	.503
235	ReAlnet04	.502
236	ViT-B/32	.502
237	ReAlnet03	.501
238	alexnet_random_linf8_perturb	.501
239	vit_huge_patch14_clip_224:laion2b_ft_in12k_in1k	.501
240	ReAlnet09	.501
241	vit_large_patch14_clip_224:laion2b_ft_in12k_in1k	.501
242	resnext101_32x16d_wsl	.499
243	BiT-M-R101x1	.499
244	BiT-S-R101x1	.499
245	vit_huge_patch14_clip_336:laion2b_ft_in12k_in1k	.499
246	ReAlnet01	.499
247	ReAlnet_incorrect	.499
248	ReAlnet05	.498
249	xception-keras	.498
250	resnet50_linf_8_robust	.496
251	CLIP_resnet50	.496
252	CLIP_resnet50_float32	.496
253	vgg-19-keras	.496
254	alexnet	.495
255	resnet50_byol	.494
256	SWSL_resnext101_32x8d	.494
257	resnet50_imagenet_100_seed-0	.494
258	vit_large_patch14_clip_224:openai_ft_in12k_in1k	.493
259	regnet_y_400mf	.493
260	resnet-18-LC_1st_conv_m	.493
261	texture_shape_resnet50_trained_on_SIN	.493
262	imagenet_l2_3_0	.492
263	vit_large_patch14_clip_336:openai_ft_in12k_in1k	.490
264	cvt_cvt-w24-384-in22k_finetuned-in1k_4	.488
265	inception_v1-tf	.488
266	ViT_L_16_imagenet1k	.487
267	deit_small_imagenet_full_seed-0	.487
268	RN50	.487
269	resnet-50-robust	.486
270	vit_large_patch14_clip_224:openai_ft_in1k	.485
271	vit_large_patch14_clip_224:laion2b_ft_in1k	.485
272	cornet_s	.485
273	0.5x_resnet-18	.483
274	vit_tiny_r_s16_p8_384:augreg_in21k_ft_in1k	.483
275	inception_v1	.483
276	vit_base_patch16_clip_224:openai_ft_in1k	.482
277	resnext101_32x8d_wsl	.481
278	mobilenet_v1_0.25_224	.480
279	mobilenet_v2_0.5_96	.479
280	vit_large_patch14_clip_336:laion2b_ft_in1k	.479
281	grcnn	.477
282	cvt_cvt-13-224-in1k_4_LucyV4	.477
283	omnivore_swinT	.476
284	blt_vs	.475
285	cvt_cvt-13-384-in22k_finetuned-in1k_4_LucyV4	.475
286	resnet-34	.474
287	swin_small_patch4_window7_224:ms_in22k_ft_in1k	.473
288	omnivore_swinS	.472
289	vgg_19	.472
290	alexnet_early_checkpoint	.472
291	yudixie_resnet50_object_class_0_240908	.471
292	deit_base_distilled_patch16_384_id	.471
293	mobilenet_v2_0.35_128	.470
294	grcnn_109	.470
295	BiT-S-R152x2	.469
296	grcnn_robust_v1	.469
297	dorinet_cornet_z	.468
298	resnet-18-LC_d_w_sh_1x1_conv_init_m	.467
299	mobilenet_v1_0.5_128	.467
300	yudixie_resnet50_category_class_0_240908	.466
301	cvt_cvt-13-384-in1k_4_LucyV4	.465
302	omnivore_swinB	.464
303	cvt_cvt-21-224-in1k_4_LucyV4	.464
304	yudixie_resnet50_cat_obj_class_all_latents_0_240908	.462
305	resnet18-contrastive_multiview	.461
306	0.5x_resnet-18_LC_w_sh_100_iter	.461
307	mobilenet_v2_0.35_96	.460
308	cvt_cvt-21-384-in1k_4_LucyV4	.459
309	squeezenet1_0	.459
310	cvt_cvt-21-384-in22k_finetuned-in1k_4_LucyV4	.458
311	alexnet_reduced_aliasing_early_checkpoint	.457
312	mobilenet_v1_0.25_160	.457
313	squeezenet1_1	.457
314	texture_shape_alexnet_trained_on_SIN	.456
315	deit_base_distilled_patch16_224_id	.455
316	deit_base_distilled_patch16_384	.455
317	deit_base_patch16_384	.455
318	deit_base_distilled_patch16_224	.455
319	deit_small_distilled_patch16_224	.455
320	deit_tiny_distilled_patch16_224	.455
321	deit_base_patch16_224	.455
322	deit_small_patch16_224	.455
323	deit_tiny_patch16_224	.455
324	AlexNet_SIN	.455
325	ViT_L_32_imagenet1k	.454
326	yolos_tiny	.452
327	mobilenet_v1_0.25_192	.451
328	deit_base_patch16_384_id	.450
329	resnet-152_v2_pytorch	.449
330	yudixie_resnet18_cat_obj_class_all_latents_0_240719	.447
331	CORnet-Z	.447
332	vonealexnet_gaussian_noise_std4_fixed	.444
333	yudixie_resnet50_distance_translation_rotation_0_240908	.443
334	FrankRobWobv0	.443
335	deit_small_distilled_patch16_224_id	.442
336	yudixie_resnet18_category_class_0_240719	.441
337	ViT_L_32	.439
338	yudixie_resnet50_distance_translation_0_240908	.437
339	ViT_B_32_imagenet1k	.437
340	yudixie_resnet18_object_class_0_240719	.436
341	alexnet_l2_3_robust	.436
342	yudixie_resnet50_translation_rotation_0_240908	.435
343	convnext_small_imagenet_100_seed-0	.434
344	resnet-18-LC_m	.434
345	alexnet_robust_correct	.432
346	mobilenet_v1_0.25_128	.425
347	vision_transformer_vit_large_patch16_224	.424
348	resnet-18-LC_conv_init_m	.423
349	ViT_B_32	.417
350	yudixie_resnet18_distance_translation_rotation_0_240719	.414
351	tf_efficientnetv2_s_in21ft1k_robust_linf12255_400x400	.414
352	deit_tiny_distilled_patch16_224_id	.414
353	0.5x_resnet-18_LC	.413
354	imagenet_l2_10_0	.411
355	resnet18-colorization	.410
356	yudixie_resnet50_rotation_reg_0_240908	.408
357	ViT_B_16_imagenet1k	.408
358	yudixie_resnet50_translation_reg_0_240908	.406
359	resnet50	.405
360	deit_tiny_patch16_224_id	.404
361	deit_small_patch16_224_id	.403
362	yudixie_resnet18_translation_rotation_0_240719	.401
363	bagnet9	.401
364	yudixie_resnet18_translation_reg_0_240719	.400
365	yudixie_resnet18_distance_translation_0_240719	.399
366	yudixie_resnet50_distance_rotation_0_240908	.395
367	resnet18-relative_position	.388
368	yudixie_resnet18_distance_rotation_0_240719	.386
369	alexnet_linf_8_robust	.382
370	yudixie_resnet18_rotation_reg_0_240719	.380
371	ViT_B_16	.380
372	briaai_rmbg_1_4	.372
373	yudixie_resnet50_distance_reg_0_240908	.371
374	yudixie_resnet18_distance_reg_0_240719	.366
375	deit_base_patch16_224_id	.357
376	r3m_resnet18	.351
377	vggface	.351
378	r3m_resnet50	.350
379	r3m_resnet34	.344
380	r3m_resnet34_nocrop	.339
381	r3m_resnet50_nocrop	.332
382	resnet18-contrastive_predictive	.325
383	artResNet18_1	.320
384	resnet18-depth_prediction	.315
385	deit_small_imagenet_100_seed-0	.307
386	CORnetZ_CIFAR10	.285
387	my-model	.283
388	resnet50-cifar	.278
389	prednet	.275
390	resnet50_imagenet_10_seed-0	.269
391	fulltest_microblockvf_nobottleneck_freeat_nopretrain_eps2_m4	.263
392	alexnet_training_seed_05	.260
393	alexnet_training_seed_08	.257
394	alexnet_training_seed_03	.257
395	alexnet_training_seed_02	.254
396	alexnet_training_seed_01	.252
397	alexnet_training_seed_06	.251
398	alexnet_training_seed_09	.250
399	alexnet_training_seed_07	.250
400	alexnet_training_seed_04	.246
401	alexnet_training_seed_10	.245
402	unet_entire	.240
403	mobilevit_small	.215
404	resnet50_FractalDB	.215
405	dcgan	.214
406	resnet50-meshes-lt-100-original-pretrained	.212
407	resnet18-autoencoder	.207
408	CORnetZ_CIFAR10_bs32_20_04	.206
409	v1-pyr-nodown	.206
410	resnet-50x2_untrained	.192
411	resnet-50_untrained	.184
412	yudixie_resnet50_random_0_240908	.174
413	resnet50_imagenet_1_seed-0	.173
414	yudixie_resnet18_random_0_240719	.165
415	resnet-18_untrained	.159
416	fitvid_trained_on_physion	.142
417	resnet-18-LC_untrained	.133
418	resnet50-meshes-lt-100-original-scratch	.132
419	resnet50_pretrained_with_retinal_waves	.130
420	hmax	.122
421	convnext_small_imagenet_10_seed-0	.109
422	deit_small_imagenet_10_seed-0	.039
423	convnext_small_imagenet_1_seed-0	.027
424	deit_small_imagenet_1_seed-0	.026
425	pixels	.015
426	my-model	.015
427	resnet_152_v1	X
428	inception_v4	X
429	xception	X
430	SeLaVi-VGG-Sound	X
431	SeLaVi-Kinetics-Sound	X
432	nasnet_large	X
433	r3d_18	X
434	mvit_v2_s	X
435	pnasnet_large	X
436	mvit_v1_b	X
437	s3d	X
438	SeLaVi-Kinetics400	X
439	r2plus1d_18	X
440	mc3_18	X
441	SeLaVi-AVE	X
442	vonegrcnn_62e_nobn	X
443	openclip	X
444	resnet_152_v2	X
445	resnet_101_v1	X
446	inception_v3_pytorch	X
447	vonegrcnn_52e_full	X
448	vonegrcnn_47e	X
449	pnasnet_large_pytorch	X
450	densenet_201_pytorch	X
451	MIM	X
452	TAU	X
453	SimVP	X
454	PredRNN	X
455	ConvLSTM	X

Benchmark bibtex

@article {Majaj13402,
            author = {Majaj, Najib J. and Hong, Ha and Solomon, Ethan A. and DiCarlo, James J.},
            title = {Simple Learned Weighted Sums of Inferior Temporal Neuronal Firing Rates Accurately Predict Human Core Object Recognition Performance},
            volume = {35},
            number = {39},
            pages = {13402--13418},
            year = {2015},
            doi = {10.1523/JNEUROSCI.5181-14.2015},
            publisher = {Society for Neuroscience},
            abstract = {To go beyond qualitative models of the biological substrate of object recognition, we ask: can a single ventral stream neuronal linking hypothesis quantitatively account for core object recognition performance over a broad range of tasks? We measured human performance in 64 object recognition tests using thousands of challenging images that explore shape similarity and identity preserving object variation. We then used multielectrode arrays to measure neuronal population responses to those same images in visual areas V4 and inferior temporal (IT) cortex of monkeys and simulated V1 population responses. We tested leading candidate linking hypotheses and control hypotheses, each postulating how ventral stream neuronal responses underlie object recognition behavior. Specifically, for each hypothesis, we computed the predicted performance on the 64 tests and compared it with the measured pattern of human performance. All tested hypotheses based on low- and mid-level visually evoked activity (pixels, V1, and V4) were very poor predictors of the human behavioral pattern. However, simple learned weighted sums of distributed average IT firing rates exactly predicted the behavioral pattern. More elaborate linking hypotheses relying on IT trial-by-trial correlational structure, finer IT temporal codes, or ones that strictly respect the known spatial substructures of IT ({	extquotedblleft}face patches{	extquotedblright}) did not improve predictive power. Although these results do not reject those more elaborate hypotheses, they suggest a simple, sufficient quantitative model: each object recognition task is learned from the spatially distributed mean firing rates (100 ms) of \~{}60,000 IT neurons and is executed as a simple weighted sum of those firing rates.SIGNIFICANCE STATEMENT We sought to go beyond qualitative models of visual object recognition and determine whether a single neuronal linking hypothesis can quantitatively account for core object recognition behavior. To achieve this, we designed a database of images for evaluating object recognition performance. We used multielectrode arrays to characterize hundreds of neurons in the visual ventral stream of nonhuman primates and measured the object recognition performance of \&gt;100 human observers. Remarkably, we found that simple learned weighted sums of firing rates of neurons in monkey inferior temporal (IT) cortex accurately predicted human performance. Although previous work led us to expect that IT would outperform V4, we were surprised by the quantitative precision with which simple IT-based linking hypotheses accounted for human behavior.},
            issn = {0270-6474},
            URL = {https://www.jneurosci.org/content/35/39/13402},
            eprint = {https://www.jneurosci.org/content/35/39/13402.full.pdf},
            journal = {Journal of Neuroscience}}

Ceiling

0.82.

Note that scores are relative to this ceiling.

Data: MajajHong2015.IT

2560 stimuli recordings from 168 sites in IT

Metric: pls