Commit Graph

9 Commits (2703d155c88d27bba9a1f465f5489a7947ffc313)

Author SHA1 Message Date
Ross Wightman 82ae247879 MambaOut weights on hub, configs finalized 2024-10-11 11:07:40 -07:00
Ross Wightman 7efb60c299 Add first_conv for mambaout 2024-10-09 14:11:40 -07:00
Ross Wightman 5dc5ee5b42 Add global_pool to mambaout __init__ and pass to heads 2024-10-09 14:11:40 -07:00
Ross Wightman 9d1dfe8dbe Incorrectly named head_hidden_size 2024-10-09 14:11:40 -07:00
Ross Wightman 91e743f2dd Mambaout tweaks 2024-10-09 14:11:40 -07:00
Ross Wightman 4542cf03f9 Add features_only, other bits to mambaout, define different base alternatives 2024-10-09 14:11:40 -07:00
Ross Wightman c2da12c7e1 Update rw models, fix heads 2024-10-09 14:11:40 -07:00
Ross Wightman f2086f51a0 Add mambaout builder support, pretrained weight remap 2024-10-09 14:11:40 -07:00
Ross Wightman c6ef54eefa Initial mambaout work 2024-10-09 14:11:40 -07:00