Commit Graph

  • 85ad1b9ceb Automatically created from 모델 배포(613:@v2-phi-1_5) by 그룹사용자(groupuser) refs/deployment/triton groupuser 2025-11-24 02:24:13 +0000
  • 0107830dfb Automatically created from 모델 배포(613:@v2-phi-1_5) by 그룹사용자(groupuser) groupuser 2025-11-24 02:24:12 +0000
  • bbdd15925a initial empty branch groupuser 2025-11-24 02:24:10 +0000
  • 638b75ac90
    Expose metadata link to next version of the model Daniel van Strien 2024-09-27 12:44:45 +0000
  • 847547f158
    Adding Evaluation Results Open LLM Leaderboard PR Bot 2024-06-12 19:03:00 +0000
  • 675aa382d8
    fix(config): Removes auto_map since it is not used anymore. main Gustavo de Rosa 2024-04-29 16:16:33 +0000
  • db561377f8
    Delete modeling_phi.py Gustavo de Rosa 2024-04-23 14:19:05 +0000
  • de9f725f6a
    Delete configuration_phi.py Gustavo de Rosa 2024-04-23 14:18:57 +0000
  • 467adac814
    Update README.md Gustavo de Rosa 2024-04-23 14:18:44 +0000
  • 474b29ef61
    Delete pytorch_model.bin Gustavo de Rosa 2024-04-17 13:35:56 +0000
  • fa2a356ff2
    Adding safetensors variant of this model (#82) Gustavo de Rosa 2024-04-17 13:35:13 +0000
  • 7e19e91fd0
    Adding safetensors variant of this model Safetensors convertbot 2024-03-27 11:23:31 +0000
  • 4f74ac4e83
    update bos_token_id and eos_token_id Chujie Zheng 2024-03-04 21:25:50 +0000
  • bffd3b29c4
    Update LICENSE Gustavo de Rosa 2024-02-06 12:36:39 +0000
  • 349cf8b5e8
    Update README.md Gustavo de Rosa 2024-01-24 13:34:13 +0000
  • 83b9c52637
    Update README.md Gustavo de Rosa 2024-01-22 12:25:40 +0000
  • 675e8c1bae
    Update config.json Gustavo de Rosa 2024-01-22 12:25:27 +0000
  • 34a1490e06
    Update modeling_phi.py Gustavo de Rosa 2024-01-16 16:05:38 +0000
  • 59e722d14e
    Update README.md Gustavo de Rosa 2024-01-16 14:56:49 +0000
  • 426ea900b0
    Update modeling_phi.py Gustavo de Rosa 2024-01-15 14:26:10 +0000
  • 81f4b01b44
    Adding safetensors variant of this model Safetensors convertbot 2024-01-12 14:20:32 +0000
  • 3edb5e62c4
    Update modeling_phi.py Gustavo de Rosa 2024-01-12 00:44:23 +0000
  • e0f03c4877
    Update modeling_phi.py Gustavo de Rosa 2024-01-11 16:40:17 +0000
  • 051d15f1e7
    Update config.json Gustavo de Rosa 2024-01-11 11:22:42 +0000
  • 914c8fb3c6 Upload modeling_phi.py Gustavo de Rosa 2024-01-10 13:54:40 +0000
  • 3a705a2d6b Delete Research License.docx Gustavo de Rosa 2024-01-10 13:16:00 +0000
  • 341a17a8f2 Upload 5 files Gustavo de Rosa 2024-01-10 13:15:50 +0000
  • 1dc35eb2f5 Update README.md (#69) Gustavo de Rosa 2024-01-10 11:29:00 +0000
  • 8584061b4d Update README.md Mojan Javaheripi 2024-01-10 00:09:19 +0000
  • 41217aafb5 Update config.json Gustavo de Rosa 2024-01-08 17:13:22 +0000
  • d3ba318b78 chore(root): Updates files to internal transformers implementation. Gustavo de Rosa 2024-01-08 13:12:24 +0000
  • 5308c1fd8c Adding safetensors variant of this model Safetensors convertbot 2023-12-15 01:22:09 +0000
  • 24f9ea14df Update README.md Gustavo de Rosa 2023-12-13 23:24:09 +0000
  • d262514668 Upload 4 files Gustavo de Rosa 2023-12-13 23:19:24 +0000
  • f27cd936bd Update README.md Gustavo de Rosa 2023-12-13 23:01:12 +0000
  • 80c0ba9f8e Update README.md Gustavo de Rosa 2023-12-13 22:44:59 +0000
  • a286f5c1de Disables inference API to prevent mismatch with HF implementation. Gustavo de Rosa 2023-12-13 21:54:41 +0000
  • ca573e3fa3 fix(modeling_phi): Fixes initial generation with length larger than context length. Gustavo de Rosa 2023-12-08 17:40:16 +0000
  • 37527ba0b8 fix(modeling_phi): Fixes cached generation when above maximum context length. Gustavo de Rosa 2023-12-05 21:09:53 +0000
  • 85d94971d2 add attn_pdrop and auto_map Susnato Dhar 2023-11-25 09:47:14 +0530
  • ff4e06fd98 changes Susnato Dhar 2023-11-23 20:14:53 +0530
  • a3ae3ab4c8 Adding safetensors variant of this model Safetensors convertbot 2023-11-20 22:38:25 +0000
  • 5fd430c7bc Fixes exceeding maximum sequence length when using generate(). Gustavo de Rosa 2023-11-20 18:11:04 +0000
  • 8bff212036 Adding Evaluation Results Open LLM Leaderboard PR Bot 2023-11-17 21:16:51 +0000
  • cc42b71bc4 Adding safetensors variant of this model Safetensors convertbot 2023-11-17 19:36:48 +0000
  • d212a78962 Delete modeling_mixformer_sequential.py Gustavo de Rosa 2023-11-16 18:10:37 +0000
  • 8e9ebfb9bf Delete configuration_mixformer_sequential.py Gustavo de Rosa 2023-11-16 18:10:30 +0000
  • 271c3397ab Update to new model interface. Gustavo de Rosa 2023-11-16 17:28:06 +0000
  • 2066613d0b prototype of unblocking onnx export titaiwang 2023-11-02 21:19:36 +0000
  • 92557d03bb Improves type hinting on configuration arguments. Gustavo de Rosa 2023-11-01 23:40:19 +0000
  • 45f4b21525 Enables to toggle fused_dense, flash_rotary and attn_pdrop in the configuration. Gustavo de Rosa 2023-11-01 23:33:57 +0000
  • 0254d42a95 Fixes flash-attn import with a try/except statement Gustavo de Rosa 2023-11-01 23:32:35 +0000
  • 0bbd68a176 Adds support for flash-attn rotary embedding and fused dense layers. Gustavo de Rosa 2023-11-01 20:40:12 +0000
  • de35f900d3 Adds support for MQA/GQA and attention mask during training. Gustavo de Rosa 2023-10-30 16:59:12 +0000
  • d38e6f954e Update modeling_mixformer_sequential.py Gustavo de Rosa 2023-10-26 20:01:15 +0000
  • 8091327f9e Adding _set_gradient_checkpointing for compatibility (#22) Gustavo de Rosa 2023-10-17 12:11:30 +0000
  • 9700feb531 Bonjour je demande de prolonger mon congé pour des raisons familiales Khaled ha 2023-10-11 01:39:44 +0000
  • a30a931294 Adding _set_gradient_checkpointing for compatibility Vicente Rivera 2023-09-18 22:17:40 -0700
  • 479d9388d0 Adding safetensors variant of this model Nikita Sherstnev 2023-10-02 12:54:35 +0000
  • 9d222b61dd Upload INDIRA - MALAH DIAJAK NGOMONG PAKE BAHASA LAIN?!.wav Sato 2023-10-02 11:36:52 +0000
  • 7b2c1f745e The birds fly on the sky Serkan Bulut 2023-09-29 07:21:39 +0000
  • b6a7e2fe15 Upload modeling_mixformer_sequential.py Gustavo de Rosa 2023-09-27 15:22:44 +0000
  • 8ab0f29ff6 Add more precise license metadata (UI will be cleaner!) (#35) Gustavo de Rosa 2023-09-27 15:20:42 +0000
  • 2c182742af Add more precise license metadata (UI will be cleaner!) Julien Chaumond 2023-09-27 14:30:18 +0000
  • bc09a085e7 Upload README.md Gustavo de Rosa 2023-09-27 14:04:07 +0000
  • f9f2ac7c45 fix(phi-1_5): Checks length of attention_maskif it is passed as direct tensor. Gustavo de Rosa 2023-09-26 21:21:45 +0000
  • 3128bb636a Support for attention_mask in forward pass. Gustavo de Rosa 2023-09-26 18:17:08 +0000
  • 9b319c095b Update README.md conf 2023-09-25 02:55:52 +0000
  • cbdc810636 Adding safetensors variant of this model Safetensors convertbot 2023-09-16 02:37:05 +0000
  • 4a426d8015 add _no_split_modules property (#17) Gustavo de Rosa 2023-09-15 22:57:07 +0000
  • 7e925ddfdf add _no_split_modules property wing lian 2023-09-15 17:21:11 +0000
  • 75d0fb4c8c Adding safetensors variant of this model Safetensors convertbot 2023-09-14 01:19:16 +0000
  • 7d482ddf93 Update README.md Gunasekar 2023-09-14 00:44:40 +0000
  • 25b3eda736 Adding safetensors variant of this model Safetensors convertbot 2023-09-12 19:05:36 +0000
  • c8f6ad8189 Update README.md Gunasekar 2023-09-12 18:40:56 +0000
  • 94ea7b7e37 Adding safetensors variant of this model Safetensors convertbot 2023-09-12 16:05:39 +0000
  • 762a3110be Link paper to arXiv (#5) Gustavo de Rosa 2023-09-12 16:01:41 +0000
  • f7c1b27656 Added cuda config on Sample Code Mahimai Raja J 2023-09-12 15:23:58 +0000
  • c30653547e Link paper to arXiv Omar Sanseviero 2023-09-12 14:57:37 +0000
  • 6222f4b28f Adding safetensors variant of this model Safetensors convertbot 2023-09-12 13:32:04 +0000
  • ea95720a35 Update README.md Gunasekar 2023-09-12 01:38:42 +0000
  • 4bba51c9b5 Update README.md Gunasekar 2023-09-11 21:45:49 +0000
  • 52e294acfe Update README.md Gunasekar 2023-09-11 21:44:15 +0000
  • 9efbcafbe4 Upload tokenizer Gunasekar 2023-09-11 21:30:53 +0000
  • d655135ca1 Upload MixFormerSequentialForCausalLM Gunasekar 2023-09-11 21:30:53 +0000
  • 07a048efa7 Update README.md Gunasekar 2023-09-11 07:57:24 +0000
  • b63051536f Update README.md Gunasekar 2023-09-11 07:56:12 +0000
  • 40b496f7e0 Update README.md Gunasekar 2023-09-11 07:50:39 +0000
  • d9c7521001 Update README.md Gunasekar 2023-09-11 07:46:06 +0000
  • 6ddac37bb9 Update README.md Gunasekar 2023-09-11 07:35:26 +0000
  • cd4510ca85 Update README.md Gunasekar 2023-09-11 07:33:48 +0000
  • 34046b03b7 Update README.md Gunasekar 2023-09-11 07:32:34 +0000
  • 24ad69c3c0 Update README.md Gunasekar 2023-09-11 02:12:39 +0000
  • b3d67f3c44 Update README.md Gunasekar 2023-09-11 01:01:17 +0000
  • 14be6562c1 Upload Research License.docx Gunasekar 2023-09-11 01:00:01 +0000
  • 6157c47c1f Upload tokenizer Gunasekar 2023-09-10 06:28:52 +0000
  • e656142af4 Upload MixFormerSequentialForCausalLM Gunasekar 2023-09-10 06:28:51 +0000
  • 4b752e7b2d Upload tokenizer Gunasekar 2023-09-10 06:16:33 +0000
  • 2bfd6ef82c Upload MixFormerSequentialForCausalLM Gunasekar 2023-09-10 06:16:29 +0000
  • 67f350b99d Upload tokenizer Gunasekar 2023-09-10 06:15:56 +0000