瀏覽代碼

公告内容截断调试

luojiehua 2 年之前
父節點
當前提交
73bddf26dc
共有 1 個文件被更改,包括 1 次插入0 次删除
  1. 1 0
      BaseDataMaintenance/maintenance/dataflow_mq.py

+ 1 - 0
BaseDataMaintenance/maintenance/dataflow_mq.py

@@ -756,6 +756,7 @@ class Dataflow_ActivteMQ_extract(Dataflow_extract):
             if html_len>200000:
                 # if int(item.get("docid"))==238431011:
                 #     save(item,"238431011.pk")
+                log("docid %s dochtmlcon too long len %d "%(str(item.get("docid")),html_len))
                 try:
                     _dochtmlcon = re.sub("<html>|</html>|<body>|</body>", "", _dochtmlcon)
                     _soup = BeautifulSoup(_dochtmlcon,"lxml")