Model Database
Models
Datasets
Spaces
Docs
Solutions
Pricing
Log In
Sign Up
microsoft
/
phi-1_5
like
665
Text Generation
Transformers
PyTorch
English
mixformer-sequential
custom_code
arxiv:
2309.05463
License:
other
Model card
Files
Files and versions
Community
25
Train
Use in Transformers
main
phi-1_5
4 contributors
History:
28 commits
gugarosa
winglian
add _no_split_modules property (
#17
)
4a426d8
5 days ago
.gitattributes
1.52 kB
initial commit
11 days ago
README.md
7.62 kB
Update README.md
7 days ago
Research License.docx
38.9 kB
Upload Research License.docx
10 days ago
added_tokens.json
1.08 kB
Upload tokenizer
10 days ago
config.json
880 Bytes
Upload MixFormerSequentialForCausalLM
10 days ago
configuration_mixformer_sequential.py
2.24 kB
Upload MixFormerSequentialForCausalLM
10 days ago
generation_config.json
69 Bytes
Upload MixFormerSequentialForCausalLM
10 days ago
merges.txt
456 kB
Upload tokenizer
10 days ago
modeling_mixformer_sequential.py
32.2 kB
add _no_split_modules property (#17)
5 days ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
,
"collections.OrderedDict"
What is a pickle import?
2.84 GB
LFS
Upload MixFormerSequentialForCausalLM
10 days ago
special_tokens_map.json
99 Bytes
Upload tokenizer
10 days ago
tokenizer.json
2.11 MB
Upload tokenizer
10 days ago
tokenizer_config.json
237 Bytes
Upload tokenizer
10 days ago
vocab.json
798 kB
Upload tokenizer
10 days ago