Zenabius commited on
Commit
fd44178
·
verified ·
1 Parent(s): 6b68337

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -17
README.md CHANGED
@@ -1,17 +1,21 @@
1
- # ONNX Model
2
-
3
- Converted from: granite-embedding-reranker-english-r2
4
-
5
- ## Files
6
- - model.onnx - FP32 version
7
- - model_quantized.onnx - INT8 quantized version
8
- - *.json - tokenizer and config files
9
-
10
- ## Usage
11
- ```python
12
- from transformers import AutoTokenizer
13
- import onnxruntime as ort
14
-
15
- tokenizer = AutoTokenizer.from_pretrained("granite-onnx")
16
- session = ort.InferenceSession("granite-onnx/model_quantized.onnx")
17
- ```
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - ibm-granite/granite-embedding-reranker-english-r2
4
+ ---
5
+ # ONNX Model
6
+
7
+ Converted from: granite-embedding-reranker-english-r2
8
+
9
+ ## Files
10
+ - model.onnx - FP32 version
11
+ - model_quantized.onnx - INT8 quantized version
12
+ - *.json - tokenizer and config files
13
+
14
+ ## Usage
15
+ ```python
16
+ from transformers import AutoTokenizer
17
+ import onnxruntime as ort
18
+
19
+ tokenizer = AutoTokenizer.from_pretrained("granite-onnx")
20
+ session = ort.InferenceSession("granite-onnx/model_quantized.onnx")
21
+ ```