fangjiasheng

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

1 month ago

fangjiasheng pushed to master at fangjiasheng/FORMAT_CONVERSION_MAXCOMPUTE

  • ab202ff1fc 1. 新增wps类型 2. 新增ofd类型 3. 新增两列无边框表格识别 4. 修复ocr爆显存 5. pdf处理速度优化 6. 特殊康熙字体处理 7. 新增监控平均处理时间

1 month ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

5 months ago

fangjiasheng pushed to master at fangjiasheng/FORMAT_CONVERSION_MAXCOMPUTE

  • ef08b56c48 附件识别,保留表格的合并单元格

6 months ago

fangjiasheng pushed to master at luojiehua/BaseDataMaintenance

6 months ago

fangjiasheng pushed to master at luojiehua/BaseDataMaintenance

6 months ago

fangjiasheng pushed to master at fangjiasheng/FORMAT_CONVERSION_MAXCOMPUTE

7 months ago

fangjiasheng pushed to master at luojiehua/BaseDataMaintenance

7 months ago

fangjiasheng pushed to master at lishimin/VerificationCode

  • 0917978c59 1. 新增验证码类型判断 2. 各验证码处理优化 3. 忽略dev

7 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

8 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

8 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

  • 4892d76f98 1. 提取招标内容类型 2. 提取项目类型 3. 字段放在pb中

8 months ago

fangjiasheng pushed to master at fangjiasheng/FORMAT_CONVERSION_MAXCOMPUTE

  • b83f835428 1. pdf去掉文字水印 2. pdf嵌套文本处理 3. pdf文本按照表格线分割 4. pdf表格线后处理优化 5. pdf乱码判断优化 6. pdf表格连接优化 7. 图片识别失败时依旧返回其他结果 8. 图片分割比例调整 9. 图片读取时判断透明部分,转为白色 10. doc、docx用tika提取文本

9 months ago

fangjiasheng pushed to master at fangjiasheng/FORMAT_CONVERSION_MAXCOMPUTE

9 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

11 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

11 months ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

1 year ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

  • 61c0b0ab27 1. 新增两个拟在建字段,用于建索引 2. extract_json中pb删掉空值

1 year ago

fangjiasheng pushed to master at luojiehua/BIDI_ML_INFO_EXTRACTION

1 year ago