SOL v0.5.3 Documentation > Frameworks > PyTorch

PyTorch

This example requires the torchvision package: https://github.com/pytorch/vision/ . Please note, that SOL does not support the use of model.eval() or model.train(). SOL always assumes model.eval() for running inference, and model.train() when running training.

In v0.5.1 we added an lazy evaluation of sol.optimize(...) which removes the necessity to provide an example input. The model instead gets created the first time it gets executed.

import torch
import sol
import torchvision.models as models

''' Optimizing Model '''
py_model  = models.__dict__["alexnet"]()
input	  = torch.rand(32, 3, 224, 224)

# Use vdims=[True] if you plan to use changing batchsizes
sol_model = sol.optimize(py_model, vdims=[True])

''' Run training '''
sol_model.train()

# You cannot initialize the optimizer at this point. You need to wait until
# you have executed the model at least once, so SOL has compiled it.
optimizer = None
for batch in ...:
	input, target = ...
	output = sol_model(input)
	loss = loss_func(output, target)
	# After running the model once, you can safely initialize the optimizer
	if optimizer is None:
		optimizer = torch.optim.Adam(sol_model.parameters(), ...)
	optimizer.zero_grad()
	loss.backward()
	optimizer.step()
	...

''' Run validation '''
sol_model.eval()
with torch.no_grad():
	for batch in ...:
		input = ...
		output = sol_model(input)
		...

F.A.Q.

How do I store/load a Pytorch model?

How do I store/load a Pytorch model?
For storing/loading a SOL PyTorch model, use `model.state_dict()` and `model.load_state_dict(...)` methods. `# Storing sol_model = sol.optimize(pytorch_model, [...]) torch.save(sol_model.state_dict(), PATH) # Loading sol_model = sol.optimize(pytorch_model) sol_model.load_state_dict(torch.load(PATH))` More information on loading/storing the weights can be found here

For storing/loading a SOL PyTorch model, use model.state_dict() and model.load_state_dict(...) methods.

# Storing
sol_model = sol.optimize(pytorch_model, [...])
torch.save(sol_model.state_dict(), PATH)

# Loading
sol_model = sol.optimize(pytorch_model)
sol_model.load_state_dict(torch.load(PATH))

More information on loading/storing the weights can be found here

Can I use `torch.compile(...)` with SOL?
Yes, with SOL ≥ v0.5.2 and PyTorch ≥ 2.0 you can use `torch.compile(model, backend='sol')` with SOL! But it provides less features than using `sol.optimize(...)`, e.g., you cannot specify the `vdims`. Instead `vdims=[True]` is used by default. You also need to manually import `import sol.pytorch` to ensure that SOL is correctly registered as backend into PyTorch. This support is still experimental!

Can I use torch.compile(...) with SOL?

Yes, with SOL ≥ v0.5.2 and PyTorch ≥ 2.0 you can use torch.compile(model, backend='sol') with SOL! But it provides less features than using sol.optimize(...), e.g., you cannot specify the vdims. Instead vdims=[True] is used by default. You also need to manually import import sol.pytorch to ensure that SOL is correctly registered as backend into PyTorch. This support is still experimental!

I get strange errors when running sol.optimize(model, ...), e.g., in Huggingface Transformers.

I get strange errors when running sol.optimize(model, ...), e.g., in Huggingface Transformers.
Huggingface Transformers are incompatible to PyTorch's `torch.jit.script(...)` parser and can only be used with `torch.jit.trace(...)` (see here). As `torch.jit.trace(...)` is much more restricted than `torch.jit.script(...)` in terms of the input and output the models we use `torch.jit.script(...)` as default parser. If you encounter problems, you can try running `sol.optimize(model, ..., trace=True)` to use the `torch.jit.trace(...)` parser instead. But be advised, that you might need to simplify your model input/output accordingly. Please see the PyTorch documentation for more details. The arguments `strict` and `check_trace` are passed to `torch.jit.trace(...)` and are False by default.

Huggingface Transformers are incompatible to PyTorch's torch.jit.script(...) parser and can only be used with torch.jit.trace(...) (see here). As torch.jit.trace(...) is much more restricted than torch.jit.script(...) in terms of the input and output the models we use torch.jit.script(...) as default parser. If you encounter problems, you can try running

sol.optimize(model, ...,
trace=True)

to use the torch.jit.trace(...) parser instead. But be advised, that you might need to simplify your model input/output accordingly. Please see the PyTorch documentation for more details. The arguments strict and check_trace are passed to torch.jit.trace(...) and are False by default.

How can I update/downgrade to another PyTorch version?
Before switching version, please have a look at the compatibility list if your PyTorch version is supported by SOL. If yes, and if you are using SOL with the NEC SX-Aurora TSUBASA, you can switch PyTorch using `pip3 install veda-pytorch~={VERSION}`. If you are using SOL with any other device, then you can just use `pip3 install torch~={VERSION}`.

The SOL model returns more outputs than the PyTorch model.

The SOL model returns more outputs than the PyTorch model.
This error occurs, i.e., in TorchVisions Inception V3 or GoogleNet. These models return 1 output in inference and 2 outputs in training mode. SOL relies on the TorchScript parser. Unfortunately the TorchVision models are build in a way that hides the change of output behavior from TorchScript. However, you can implement this yourself as follows: from torchvision import models class Wrap(torch.nn.Module): def __init__(self, model): super().__init__() self.model = model def forward(self, x): out = self.model(x) if torch.jit.is_scripting(): return (out[0], out[1]) if self.training else (out[0], None) return (out[0], out[1]) if self.training else (out, None) model = Wrap(models.inception_v3()) # use only one output model.training = False sol_model = sol.optimize(model, ...) # use two outputs model.training = True sol_model = sol.optimize(model, ...) SOL currently does not support to dynamically switch between these two modes and requires to compile the model for each mode separately.

This error occurs, i.e., in TorchVisions Inception V3 or GoogleNet. These models return 1 output in inference and 2 outputs in training mode. SOL relies on the TorchScript parser. Unfortunately the TorchVision models are build in a way that hides the change of output behavior from TorchScript. However, you can implement this yourself as follows:

from torchvision import models

class Wrap(torch.nn.Module):
	def __init__(self, model):
		super().__init__()
		self.model = model

	def forward(self, x):
		out = self.model(x)
		if torch.jit.is_scripting():
			return (out[0], out[1]) if self.training else (out[0], None)
		return (out[0], out[1]) if self.training else (out, None)

model = Wrap(models.inception_v3())

# use only one output
model.training = False
sol_model = sol.optimize(model, ...)

# use two outputs
model.training = True
sol_model = sol.optimize(model, ...)

SOL currently does not support to dynamically switch between these two modes and requires to compile the model for each mode separately.

How can I use Pytorch Lightning with SOL?
You can just pass your Pytorch Lightning model to SOL's `sol.optimize(...)` method. `class ResNet50(pl.LightningModule): def __init__(self): super().__init__() self.model = torchvision.models.resnet50() def forward(self, x): return self.model(x) model = sol.optimize(ResNet50())`

Can I implement custom layers using SOL?
Please refer to Custom Layers.

Supported Layers

Please refer to https://pytorch.org/docs/stable/ for how these functions are used. This documentation only contains which layers, functions and tensor functionality is currently implemented within SOL.

Layers

aten::Bool
aten::Float
aten::Int
aten::IntImplicit
aten::ScalarImplicit
aten::__and__
aten::__contains__
aten::__derive_index
aten::__getitem__
aten::__is__
aten::__isnot__
aten::__not__
aten::__or__
aten::__range_length
aten::_convolution
aten::_set_item
aten::abs
aten::absolute
aten::acos
aten::acosh
aten::adaptive_avg_pool1d
aten::adaptive_avg_pool2d
aten::adaptive_avg_pool3d
aten::adaptive_max_pool1d
aten::adaptive_max_pool2d
aten::adaptive_max_pool3d
aten::add
aten::addbmm
aten::addcdiv
aten::addcmul
aten::addmm
aten::all
aten::alpha_dropout
aten::any
aten::append
aten::arange
aten::arccos
aten::arccosh
aten::arcsin
aten::arcsinh
aten::arctan
aten::arctanh
aten::argmax
aten::argmin
aten::as_tensor
aten::asin
aten::asinh
aten::atan
aten::atanh
aten::avg_pool1d
aten::avg_pool2d
aten::avg_pool3d
aten::baddbmm
aten::batch_norm
aten::bernoulli
aten::bitwise_and
aten::bitwise_left_shift
aten::bitwise_not
aten::bitwise_or
aten::bitwise_right_shift
aten::bitwise_xor
aten::bmm
aten::broadcast_tensors
aten::broadcast_to
aten::cat
aten::ceil
aten::celu
aten::chunk
aten::clamp
aten::clamp_max
aten::clamp_min
aten::clone
aten::complex
aten::concat
aten::constant_pad_nd
aten::contiguous
aten::conv1d
aten::conv2d
aten::conv3d
aten::conv_transpose1d
aten::conv_transpose2d
aten::conv_transpose3d
aten::copy
aten::cos
aten::cosh
aten::cumsum
aten::detach
aten::device
aten::dict
aten::dim
aten::div
aten::divide
aten::dropout
aten::einsum
aten::elu
aten::embedding
aten::empty
aten::eq
aten::equal
aten::erf
aten::exp2
aten::exp
aten::expand
aten::expand_as
aten::expm1
aten::eye
aten::fft_fft2
aten::fft_fft
aten::fft_fftn
aten::fft_hfft
aten::fft_ifft2
aten::fft_ifft
aten::fft_ifftn
aten::fft_ihfft
aten::fft_irfft2
aten::fft_irfft
aten::fft_irfftn
aten::fft_rfft2
aten::fft_rfft
aten::fft_rfftn
aten::fill
aten::flatten
aten::floor
aten::floor_divide
aten::floordiv
aten::fmod
aten::format
aten::frobenius_norm
aten::full
aten::ge
aten::gelu
aten::greater
aten::greater_equal
aten::group_norm
aten::gru
aten::gru_cell
aten::gt
aten::hardshrink
aten::hardsigmoid
aten::hardswish
aten::hardtanh
aten::imag
aten::index
aten::instance_norm
aten::is_autocast_enabled
aten::is_floating_point
aten::isfinite
aten::isinf
aten::isnan
aten::items
aten::l1_loss
aten::layer_norm
aten::le
aten::leaky_relu
aten::len
aten::lift_fresh
aten::linear
aten::list
aten::log10
aten::log1p
aten::log2
aten::log
aten::log_sigmoid
aten::log_softmax
aten::logaddexp2
aten::logaddexp
aten::logical_and
aten::logical_not
aten::logical_or
aten::logical_xor
aten::lstm
aten::lstm_cell
aten::lt
aten::masked_fill
aten::matmul
aten::max
aten::max_pool1d
aten::max_pool1d_with_indices
aten::max_pool2d
aten::max_pool2d_with_indices
aten::max_pool3d
aten::max_pool3d_with_indices
aten::max_unpool1d
aten::max_unpool2d
aten::max_unpool3d
aten::maximum
aten::mean
aten::meshgrid
aten::min
aten::minimum
aten::mm
aten::mse_loss
aten::mul
aten::multiply
aten::narrow
aten::narrow_copy
aten::ne
aten::neg
aten::negative
aten::new_full
aten::norm
aten::not_equal
aten::nuclear_norm
aten::numel
aten::ones
aten::ones_like
aten::pad
aten::percentFormat
aten::permute
aten::pow
aten::prelu
aten::prod
aten::rand
aten::rand_like
aten::randint
aten::randint_like
aten::randn
aten::randn_like
aten::real
aten::reciprocal
aten::relu6
aten::relu
aten::remainder
aten::repeat
aten::reshape
aten::reshape_as
aten::rnn_relu
aten::rnn_relu_cell
aten::rnn_tanh
aten::rnn_tanh_cell
aten::rrelu
aten::rsqrt
aten::rsub
aten::select
aten::selu
aten::sigmoid
aten::sign
aten::silu
aten::sin
aten::sinh
aten::size
aten::slice
aten::smooth_l1_loss
aten::softmax
aten::softmin
aten::softplus
aten::softshrink
aten::split
aten::sqrt
aten::square
aten::squeeze
aten::stack
aten::str
aten::sub
aten::sum
aten::tan
aten::tanh
aten::tensor
aten::tile
aten::to
aten::transpose
aten::tril
aten::triu
aten::unbind
aten::unsqueeze
aten::upsample_bicubic2d
aten::upsample_bilinear2d
aten::upsample_linear1d
aten::upsample_nearest1d
aten::upsample_nearest2d
aten::upsample_nearest3d
aten::upsample_trilinear3d
aten::values
aten::var
aten::view
aten::warn
aten::where
aten::zeros
aten::zeros_like
prim::CallFunction
prim::Constant
prim::CreateObject
prim::DictConstruct
prim::GetAttr
prim::If
prim::ListConstruct
prim::ListIndex
prim::ListUnpack
prim::Loop
prim::NumToTensor
prim::Print
prim::PythonOp
prim::RaiseException
prim::SetAttr
prim::TupleConstruct
prim::TupleIndex
prim::TupleUnpack
prim::Uninitialized
prim::device
prim::dtype
prim::is_nested
prim::isinstance
prim::layout
prim::max
prim::min
prim::type
prim::unchecked_cast

Tested Models

TorchVision

alexnet
convnext_base
convnext_large
convnext_small
convnext_tiny
densenet121
densenet161
densenet169
densenet201
efficientnet_b0
efficientnet_b1
efficientnet_b2
efficientnet_b3
efficientnet_b4
efficientnet_b5
efficientnet_b6
efficientnet_b7
efficientnet_v2_l
efficientnet_v2_m
efficientnet_v2_s
lraspp_mobilenet_v3_large
mnasnet0_5
mnasnet0_75
mnasnet1_0
mnasnet1_3
mobilenet_v2
mobilenet_v3_large
mobilenet_v3_small
quantized_mobilenet_v2
quantized_mobilenet_v3_large
quantized_resnet18
quantized_resnet50
quantized_resnext101_32x8d
quantized_resnext101_64x4d
quantized_shufflenet_v2_x0_5
quantized_shufflenet_v2_x1_0
quantized_shufflenet_v2_x1_5
quantized_shufflenet_v2_x2_0
regnet_x_16gf
regnet_x_1_6gf
regnet_x_32gf
regnet_x_3_2gf
regnet_x_400mf
regnet_x_800mf
regnet_x_8gf
regnet_y_128gf
regnet_y_16gf
regnet_y_1_6gf
regnet_y_32gf
regnet_y_3_2gf
regnet_y_400mf
regnet_y_800mf
regnet_y_8gf
resnet101
resnet152
resnet18
resnet34
resnet50
resnext101_32x8d
resnext101_64x4d
resnext50_32x4d
shufflenet_v2_x0_5
shufflenet_v2_x1_0
shufflenet_v2_x1_5
shufflenet_v2_x2_0
squeezenet1_0
squeezenet1_1
vgg11
vgg11_bn
vgg13
vgg13_bn
vgg16
vgg16_bn
vgg19
vgg19_bn
wide_resnet101_2
wide_resnet50_2

Huggingface

BertForSequenceClassification
BertModel
BloomForCausalLM
BloomModel
DistilBertForTokenClassification
GPT2Model
LlamaModel

TIMM

adv_inception_v3
beit_base_patch16_224
beit_base_patch16_224_in22k
beit_base_patch16_384
beit_large_patch16_224
beit_large_patch16_224_in22k
beit_large_patch16_384
beit_large_patch16_512
beitv2_base_patch16_224
beitv2_base_patch16_224_in22k
beitv2_large_patch16_224
beitv2_large_patch16_224_in22k
botnet26t_256
botnet50ts_256
cait_m36_384
cait_m48_448
cait_s24_224
cait_s24_384
cait_s36_384
cait_xs24_384
cait_xxs24_224
cait_xxs24_384
cait_xxs36_224
cait_xxs36_384
coat_lite_mini
coat_lite_small
coat_lite_tiny
coat_mini
coat_tiny
coatnet_0_224
coatnet_0_rw_224
coatnet_1_224
coatnet_1_rw_224
coatnet_2_224
coatnet_2_rw_224
coatnet_3_224
coatnet_3_rw_224
coatnet_4_224
coatnet_5_224
coatnet_bn_0_rw_224
coatnet_nano_cc_224
coatnet_nano_rw_224
coatnet_pico_rw_224
coatnet_rmlp_0_rw_224
coatnet_rmlp_1_rw_224
coatnet_rmlp_2_rw_224
coatnet_rmlp_3_rw_224
coatnet_rmlp_nano_rw_224
coatnext_nano_rw_224
convmixer_1024_20_ks9_p14
convmixer_1536_20
convmixer_768_32
convnext_atto
convnext_atto_ols
convnext_base
convnext_base_384_in22ft1k
convnext_base_in22ft1k
convnext_base_in22k
convnext_femto
convnext_femto_ols
convnext_large
convnext_large_384_in22ft1k
convnext_large_in22ft1k
convnext_large_in22k
convnext_nano
convnext_nano_ols
convnext_pico
convnext_pico_ols
convnext_small
convnext_small_384_in22ft1k
convnext_small_in22ft1k
convnext_small_in22k
convnext_tiny
convnext_tiny_384_in22ft1k
convnext_tiny_hnf
convnext_tiny_in22ft1k
convnext_tiny_in22k
convnext_xlarge_384_in22ft1k
convnext_xlarge_in22ft1k
convnext_xlarge_in22k
cs3darknet_focus_l
cs3darknet_focus_m
cs3darknet_focus_s
cs3darknet_focus_x
cs3darknet_l
cs3darknet_m
cs3darknet_s
cs3darknet_x
cs3edgenet_x
cs3se_edgenet_x
cs3sedarknet_l
cs3sedarknet_x
cs3sedarknet_xdw
cspdarknet53
cspresnet50
cspresnet50d
cspresnet50w
cspresnext50
darknet17
darknet21
darknet53
darknetaa53
deit3_base_patch16_224
deit3_base_patch16_224_in21ft1k
deit3_base_patch16_384
deit3_base_patch16_384_in21ft1k
deit3_huge_patch14_224
deit3_huge_patch14_224_in21ft1k
deit3_large_patch16_224
deit3_large_patch16_224_in21ft1k
deit3_large_patch16_384
deit3_large_patch16_384_in21ft1k
deit3_medium_patch16_224
deit3_medium_patch16_224_in21ft1k
deit3_small_patch16_224
deit3_small_patch16_224_in21ft1k
deit3_small_patch16_384
deit3_small_patch16_384_in21ft1k
deit_base_distilled_patch16_224
deit_base_distilled_patch16_384
deit_base_patch16_224
deit_base_patch16_384
deit_small_distilled_patch16_224
deit_small_patch16_224
deit_tiny_distilled_patch16_224
deit_tiny_patch16_224
densenet121
densenet121d
densenet161
densenet169
densenet201
densenet264
dla102
dla102x2
dla102x
dla169
dla34
dla46_c
dla46x_c
dla60
dla60_res2net
dla60_res2next
dla60x
dla60x_c
dm_nfnet_f0
dm_nfnet_f1
dm_nfnet_f2
dm_nfnet_f3
dm_nfnet_f4
dm_nfnet_f5
dm_nfnet_f6
dpn107
dpn131
dpn68
dpn68b
dpn92
dpn98
eca_botnext26ts_256
eca_nfnet_l0
eca_nfnet_l1
eca_nfnet_l2
eca_nfnet_l3
eca_resnet33ts
eca_resnext26ts
eca_vovnet39b
ecaresnet101d
ecaresnet101d_pruned
ecaresnet200d
ecaresnet269d
ecaresnet26t
ecaresnet50d
ecaresnet50d_pruned
ecaresnet50t
ecaresnetlight
ecaresnext26t_32x4d
ecaresnext50t_32x4d
edgenext_base
efficientformer_l1
efficientformer_l3
efficientformer_l7
efficientnet_b0
efficientnet_b0_g16_evos
efficientnet_b0_g8_gn
efficientnet_b0_gn
efficientnet_b1
efficientnet_b1_pruned
efficientnet_b2
efficientnet_b2_pruned
efficientnet_b2a
efficientnet_b3
efficientnet_b3_g8_gn
efficientnet_b3_gn
efficientnet_b3_pruned
efficientnet_b3a
efficientnet_b4
efficientnet_b5
efficientnet_b6
efficientnet_b7
efficientnet_b8
efficientnet_cc_b0_4e
efficientnet_cc_b0_8e
efficientnet_cc_b1_8e
efficientnet_el
efficientnet_el_pruned
efficientnet_em
efficientnet_es
efficientnet_es_pruned
efficientnet_l2
efficientnet_lite0
efficientnet_lite1
efficientnet_lite2
efficientnet_lite3
efficientnet_lite4
efficientnetv2_m
efficientnetv2_rw_s
efficientnetv2_rw_t
efficientnetv2_s
efficientnetv2_xl
ens_adv_inception_resnet_v2
ese_vovnet19b_dw
ese_vovnet19b_slim
ese_vovnet19b_slim_dw
ese_vovnet39b
ese_vovnet39b_evos
ese_vovnet57b
ese_vovnet99b
fbnetc_100
fbnetv3_b
fbnetv3_d
fbnetv3_g
gc_efficientnetv2_rw_t
gcresnet33ts
gcresnet50t
gcresnext26ts
gcresnext50ts
gcvit_base
gcvit_small
gcvit_tiny
gcvit_xtiny
gcvit_xxtiny
gernet_l
gernet_m
gernet_s
ghostnet_050
ghostnet_100
ghostnet_130
gluon_inception_v3
gluon_resnet101_v1b
gluon_resnet101_v1c
gluon_resnet101_v1d
gluon_resnet101_v1s
gluon_resnet152_v1b
gluon_resnet152_v1c
gluon_resnet152_v1d
gluon_resnet152_v1s
gluon_resnet18_v1b
gluon_resnet34_v1b
gluon_resnet50_v1b
gluon_resnet50_v1c
gluon_resnet50_v1d
gluon_resnet50_v1s
gluon_resnext101_32x4d
gluon_resnext101_64x4d
gluon_resnext50_32x4d
gluon_senet154
gluon_seresnext101_32x4d
gluon_seresnext101_64x4d
gluon_seresnext50_32x4d
gluon_xception65
gmixer_12_224
gmixer_24_224
gmlp_b16_224
gmlp_s16_224
gmlp_ti16_224
hardcorenas_a
hardcorenas_b
hardcorenas_c
hardcorenas_d
hardcorenas_e
hardcorenas_f
hrnet_w18
hrnet_w18_small
hrnet_w18_small_v2
hrnet_w30
hrnet_w32
hrnet_w40
hrnet_w44
hrnet_w48
hrnet_w64
ig_resnext101_32x16d
ig_resnext101_32x32d
ig_resnext101_32x48d
ig_resnext101_32x8d
inception_resnet_v2
inception_v3
inception_v4
jx_nest_base
jx_nest_small
jx_nest_tiny
lambda_resnet26t
lambda_resnet50ts
lcnet_035
lcnet_050
lcnet_075
lcnet_100
lcnet_150
legacy_senet154
legacy_seresnet101
legacy_seresnet18
legacy_seresnet34
legacy_seresnet50
legacy_seresnext101_32x4d
legacy_seresnext26_32x4d
legacy_seresnext50_32x4d
maxvit_nano_rw_256
maxvit_pico_rw_256
maxvit_rmlp_nano_rw_256
maxvit_rmlp_pico_rw_256
maxvit_rmlp_small_rw_224
maxvit_rmlp_small_rw_256
maxvit_rmlp_tiny_rw_256
maxvit_small_224
maxvit_tiny_224
maxvit_tiny_rw_224
maxvit_tiny_rw_256
maxxvit_rmlp_nano_rw_256
maxxvit_rmlp_small_rw_256
maxxvit_rmlp_tiny_rw_256
mixer_b16_224
mixer_b16_224_in21k
mixer_b16_224_miil
mixer_b16_224_miil_in21k
mixer_b32_224
mixer_l16_224
mixer_l16_224_in21k
mixer_l32_224
mixer_s16_224
mixer_s32_224
mixnet_l
mixnet_m
mixnet_s
mixnet_xl
mixnet_xxl
mnasnet_050
mnasnet_075
mnasnet_100
mnasnet_140
mnasnet_a1
mnasnet_b1
mnasnet_small
mobilenetv2_035
mobilenetv2_050
mobilenetv2_075
mobilenetv2_100
mobilenetv2_110d
mobilenetv2_120d
mobilenetv2_140
mobilenetv3_large_075
mobilenetv3_large_100
mobilenetv3_large_100_miil
mobilenetv3_large_100_miil_in21k
mobilenetv3_rw
mobilenetv3_small_050
mobilenetv3_small_075
mobilenetv3_small_100
mobilevit_s
mobilevit_xs
mobilevit_xxs
mobilevitv2_050
mobilevitv2_075
mobilevitv2_100
mobilevitv2_125
mobilevitv2_150
mobilevitv2_150_384_in22ft1k
mobilevitv2_150_in22ft1k
mobilevitv2_175
mobilevitv2_175_384_in22ft1k
mobilevitv2_175_in22ft1k
mobilevitv2_200
mobilevitv2_200_384_in22ft1k
mobilevitv2_200_in22ft1k
nasnetalarge
nest_base
nest_base
nest_small
nest_tiny
nf_ecaresnet101
nf_ecaresnet26
nf_ecaresnet50
nf_regnet_b0
nf_regnet_b1
nf_regnet_b2
nf_regnet_b3
nf_regnet_b4
nf_regnet_b5
nf_resnet101
nf_resnet26
nf_resnet50
nf_seresnet101
nf_seresnet26
nf_seresnet50
nfnet_f0
nfnet_f1
nfnet_f2
nfnet_f3
nfnet_f4
nfnet_f5
nfnet_f6
nfnet_f7
nfnet_l0
pnasnet5large
poolformer_m36
poolformer_m48
poolformer_s12
poolformer_s24
poolformer_s36
pvt_v2_b0
pvt_v2_b1
pvt_v2_b2
pvt_v2_b2_li
pvt_v2_b3
pvt_v2_b4
pvt_v2_b5
pvt_v2_b5
regnetv_040
regnetv_064
regnetx_002
regnetx_004
regnetx_006
regnetx_008
regnetx_016
regnetx_032
regnetx_040
regnetx_064
regnetx_080
regnetx_120
regnetx_160
regnetx_320
regnety_002
regnety_004
regnety_006
regnety_008
regnety_016
regnety_032
regnety_040
regnety_040s_gn
regnety_064
regnety_080
regnety_120
regnety_160
regnety_320
regnetz_005
regnetz_040
regnetz_040h
regnetz_b16
regnetz_b16_evos
regnetz_c16
regnetz_c16_evos
regnetz_d32
regnetz_d8
regnetz_d8_evos
regnetz_e8
repvgg_a2
repvgg_b0
repvgg_b1
repvgg_b1g4
repvgg_b2
repvgg_b2g4
repvgg_b3
repvgg_b3g4
res2net101_26w_4s
res2net50_14w_8s
res2net50_26w_4s
res2net50_26w_6s
res2net50_26w_8s
res2net50_48w_2s
res2next50
resmlp_12_224
resmlp_12_224_dino
resmlp_12_distilled_224
resmlp_24_224
resmlp_24_224_dino
resmlp_24_distilled_224
resmlp_36_224
resmlp_36_distilled_224
resmlp_big_24_224
resmlp_big_24_224_in22ft1k
resmlp_big_24_distilled_224
resnest101e
resnest14d
resnest200e
resnest269e
resnest26d
resnest50d
resnest50d_1s4x24d
resnest50d_4s2x40d
resnet101
resnet101d
resnet10t
resnet14t
resnet152
resnet152d
resnet18
resnet18d
resnet200
resnet200d
resnet26
resnet26d
resnet26t
resnet32ts
resnet33ts
resnet34
resnet34d
resnet50
resnet50_gn
resnet50d
resnet50q
resnet50t
resnet51q
resnet61q
resnetaa101d
resnetaa50
resnetaa50d
resnetrs101
resnetrs152
resnetrs200
resnetrs270
resnetrs350
resnetrs420
resnetrs50
resnetv2_101
resnetv2_101d
resnetv2_101x1_bitm
resnetv2_101x1_bitm_in21k
resnetv2_101x3_bitm
resnetv2_101x3_bitm_in21k
resnetv2_152
resnetv2_152d
resnetv2_152x2_bit_teacher
resnetv2_152x2_bit_teacher_384
resnetv2_152x2_bitm
resnetv2_152x2_bitm_in21k
resnetv2_152x4_bitm
resnetv2_152x4_bitm_in21k
resnetv2_50
resnetv2_50d
resnetv2_50d_evos
resnetv2_50d_frn
resnetv2_50d_gn
resnetv2_50t
resnetv2_50x1_bit_distilled
resnetv2_50x1_bitm
resnetv2_50x1_bitm_in21k
resnetv2_50x3_bitm
resnetv2_50x3_bitm_in21k
resnext101_32x4d
resnext101_32x8d
resnext101_64x4d
resnext26ts
resnext50_32x4d
resnext50d_32x4d
rexnet_100
rexnet_130
rexnet_150
rexnet_200
rexnetr_100
rexnetr_130
rexnetr_150
rexnetr_200
sebotnet33ts_256
sebotnet33ts_256
sedarknet21
selecsls42
selecsls42b
selecsls60
selecsls60b
selecsls84
semnasnet_050
semnasnet_075
semnasnet_100
semnasnet_140
semobilevit_s
senet154
sequencer2d_l
sequencer2d_m
sequencer2d_s
seresnet101
seresnet152
seresnet152d
seresnet18
seresnet200d
seresnet269d
seresnet33ts
seresnet34
seresnet50
seresnet50t
seresnetaa50d
seresnext101_32x4d
seresnext101_32x8d
seresnext101d_32x8d
seresnext26d_32x4d
seresnext26t_32x4d
seresnext26tn_32x4d
seresnext26ts
seresnext50_32x4d
seresnextaa101d_32x8d
skresnet18
skresnet34
skresnet50
skresnet50d
skresnext50_32x4d
spnasnet_100
ssl_resnet18
ssl_resnet50
ssl_resnext101_32x16d
ssl_resnext101_32x4d
ssl_resnext101_32x8d
ssl_resnext50_32x4d
swsl_resnet18
swsl_resnet50
swsl_resnext101_32x16d
swsl_resnext101_32x4d
swsl_resnext101_32x8d
swsl_resnext50_32x4d
tf_efficientnet_b0
tf_efficientnet_b0_ap
tf_efficientnet_b0_ns
tf_efficientnet_b1
tf_efficientnet_b1_ap
tf_efficientnet_b1_ns
tf_efficientnet_b2
tf_efficientnet_b2_ap
tf_efficientnet_b2_ns
tf_efficientnet_b3
tf_efficientnet_b3_ap
tf_efficientnet_b3_ns
tf_efficientnet_b4
tf_efficientnet_b4_ap
tf_efficientnet_b4_ns
tf_efficientnet_b5
tf_efficientnet_b5_ap
tf_efficientnet_b5_ns
tf_efficientnet_b6
tf_efficientnet_b6_ap
tf_efficientnet_b6_ns
tf_efficientnet_b7
tf_efficientnet_b7_ap
tf_efficientnet_b7_ns
tf_efficientnet_b8
tf_efficientnet_b8_ap
tf_efficientnet_cc_b0_4e
tf_efficientnet_cc_b0_8e
tf_efficientnet_cc_b1_8e
tf_efficientnet_el
tf_efficientnet_em
tf_efficientnet_es
tf_efficientnet_l2_ns
tf_efficientnet_l2_ns_475
tf_efficientnet_lite0
tf_efficientnet_lite1
tf_efficientnet_lite2
tf_efficientnet_lite3
tf_efficientnet_lite4
tf_efficientnetv2_b0
tf_efficientnetv2_b1
tf_efficientnetv2_b2
tf_efficientnetv2_b3
tf_efficientnetv2_m
tf_efficientnetv2_m_in21ft1k
tf_efficientnetv2_s
tf_efficientnetv2_s_in21ft1k
tf_efficientnetv2_s_in21k
tf_inception_v3
tf_mixnet_l
tf_mixnet_m
tf_mixnet_s
tf_mobilenetv3_large_075
tf_mobilenetv3_large_100
tf_mobilenetv3_large_minimal_100
tf_mobilenetv3_small_075
tf_mobilenetv3_small_100
tf_mobilenetv3_small_minimal_100
tinynet_a
tinynet_b
tinynet_c
tinynet_d
tinynet_e
tresnet_l
tresnet_l_448
tresnet_m
tresnet_m_448
tresnet_m_miil_in21k
tresnet_v2_l
tresnet_xl
tresnet_xl_448
tv_densenet121
tv_resnet101
tv_resnet152
tv_resnet34
tv_resnet50
tv_resnext50_32x4d
twins_pcpvt_base
twins_pcpvt_large
twins_pcpvt_small
twins_svt_base
twins_svt_large
twins_svt_small
vgg11
vgg11_bn
vgg13
vgg13_bn
vgg16
vgg16_bn
vgg19
vgg19_bn
visformer_small
visformer_tiny
vit_base_patch16_18x2_224
vit_base_patch16_224
vit_base_patch16_224_dino
vit_base_patch16_224_in21k
vit_base_patch16_224_miil
vit_base_patch16_224_miil_in21k
vit_base_patch16_224_sam
vit_base_patch16_384
vit_base_patch16_plus_240
vit_base_patch16_rpn_224
vit_base_patch32_224
vit_base_patch32_224_clip_laion2b
vit_base_patch32_224_in21k
vit_base_patch32_224_sam
vit_base_patch32_384
vit_base_patch32_plus_256
vit_base_patch8_224
vit_base_patch8_224_dino
vit_base_patch8_224_in21k
vit_base_r26_s32_224
vit_base_r50_s16_224
vit_base_r50_s16_224_in21k
vit_base_r50_s16_384
vit_base_resnet26d_224
vit_base_resnet50_224_in21k
vit_base_resnet50_384
vit_base_resnet50d_224
vit_giant_patch14_224
vit_giant_patch14_224_clip_laion2b
vit_gigantic_patch14_224
vit_huge_patch14_224
vit_huge_patch14_224_clip_laion2b
vit_huge_patch14_224_in21k
vit_large_patch14_224
vit_large_patch14_224_clip_laion2b
vit_large_patch16_224
vit_large_patch16_224_in21k
vit_large_patch16_384
vit_large_patch32_224
vit_large_patch32_224_in21k
vit_large_patch32_384
vit_large_r50_s32_224
vit_large_r50_s32_224_in21k
vit_large_r50_s32_384
vit_relpos_base_patch16_224
vit_relpos_base_patch16_cls_224
vit_relpos_base_patch16_clsgap_224
vit_relpos_base_patch16_plus_240
vit_relpos_base_patch16_rpn_224
vit_relpos_base_patch32_plus_rpn_256
vit_relpos_medium_patch16_224
vit_relpos_medium_patch16_cls_224
vit_relpos_medium_patch16_rpn_224
vit_relpos_small_patch16_224
vit_relpos_small_patch16_rpn_224
vit_small_patch16_18x2_224
vit_small_patch16_224
vit_small_patch16_224_dino
vit_small_patch16_224_in21k
vit_small_patch16_36x1_224
vit_small_patch16_384
vit_small_patch32_224
vit_small_patch32_224_in21k
vit_small_patch32_384
vit_small_patch8_224_dino
vit_small_r26_s32_224
vit_small_r26_s32_224_in21k
vit_small_r26_s32_384
vit_small_resnet26d_224
vit_small_resnet50d_s16_224
vit_srelpos_medium_patch16_224
vit_srelpos_small_patch16_224
vit_tiny_patch16_224
vit_tiny_patch16_224_in21k
vit_tiny_patch16_384
vit_tiny_r_s16_p8_224
vit_tiny_r_s16_p8_224_in21k
vit_tiny_r_s16_p8_384
vovnet39a
vovnet57a
wide_resnet101_2
wide_resnet50_2
xception41
xception41p
xception65
xception65p
xception71
xception
xcit_large_24_p16_224
xcit_large_24_p16_224_dist
xcit_large_24_p16_384_dist
xcit_large_24_p8_224
xcit_large_24_p8_224_dist
xcit_large_24_p8_384_dist
xcit_medium_24_p16_224
xcit_medium_24_p16_224_dist
xcit_medium_24_p16_384_dist
xcit_medium_24_p8_224
xcit_medium_24_p8_224_dist
xcit_medium_24_p8_384_dist
xcit_nano_12_p16_224
xcit_nano_12_p16_224_dist
xcit_nano_12_p16_384_dist
xcit_nano_12_p8_224
xcit_nano_12_p8_224_dist
xcit_nano_12_p8_384_dist
xcit_small_12_p16_224
xcit_small_12_p16_224_dist
xcit_small_12_p16_384_dist
xcit_small_12_p8_224
xcit_small_12_p8_224_dist
xcit_small_12_p8_384_dist
xcit_small_24_p16_224
xcit_small_24_p16_224_dist
xcit_small_24_p16_384_dist
xcit_small_24_p8_224
xcit_small_24_p8_224_dist
xcit_small_24_p8_384_dist
xcit_tiny_12_p16_224
xcit_tiny_12_p16_224_dist
xcit_tiny_12_p16_384_dist
xcit_tiny_12_p8_224
xcit_tiny_12_p8_224_dist
xcit_tiny_12_p8_384_dist
xcit_tiny_24_p16_224
xcit_tiny_24_p16_224_dist
xcit_tiny_24_p16_384_dist
xcit_tiny_24_p8_224
xcit_tiny_24_p8_224_dist
xcit_tiny_24_p8_384_dist