Document Processing Using Machine Learning
暫譯: 使用機器學習的文件處理

Obaidullah, Sk MD, Santosh, Kc, Goncalves, Teresa

  • 出版商: CRC
  • 出版日期: 2019-12-02
  • 售價: $5,870
  • 貴賓價: 9.5$5,577
  • 語言: 英文
  • 頁數: 182
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 036721847X
  • ISBN-13: 9780367218478
  • 相關分類: Machine Learning
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

The book aims at presenting a handful of resources for students and researchers, who are working in the document image analysis (DIA) domain using machine learning since it covers multiple document processing problems. Starting with the explanation of how Artificial Intelligence (AI) plays an important role in this domain, the book further discusses how different machine learning algorithms can be applied for classification/recognition and clustering problems regardless the use of input data: image or simply texts.

In brief, the book offers comprehensive coverage of the most essential topics, including:

  • Role of AI for document image analysis
  • Optical character recognition
  • Machine learning algorithms for document analysis
  • Extreme learning machine and its applications (document understanding)
  • Mathematical foundation for Web text document analysis
  • Social media data analysis
  • Modalities for document dataset generation.

This book serves to undergraduate and graduate scholars in Computer Science/Information Technology/Electrical and Computer Engineering. Further, it is a great fit for early career research scientists and industrialists in the domain.

商品描述(中文翻譯)

本書旨在為從事文件影像分析(Document Image Analysis, DIA)領域的學生和研究人員提供一系列資源,因為它涵蓋了多種文件處理問題。書中首先解釋了人工智慧(Artificial Intelligence, AI)在此領域的重要角色,接著討論了不同的機器學習演算法如何應用於分類/識別和聚類問題,無論輸入數據是影像還是純文本。

簡而言之,本書全面涵蓋了最重要的主題,包括:
- AI 在文件影像分析中的角色
- 光學字符識別(Optical Character Recognition, OCR)
- 用於文件分析的機器學習演算法
- 極限學習機(Extreme Learning Machine)及其應用(文件理解)
- 網頁文本文件分析的數學基礎
- 社交媒體數據分析
- 文件數據集生成的模式

本書適合計算機科學、資訊科技及電機與計算機工程的本科生和研究生。此外,它也非常適合在該領域的早期職業研究科學家和業界人士。

作者簡介

Sk Md Obaidullah has completed Ph.D(Engg.) from Jadavpur University, M.Tech in Computer Sc. & Application from University of Calcutta and B.E in Computer Sc. & Engineering from Vidyasagar University in the year 2017, 2009, 2004 respectively. He was Erasmus Post-Doctoral fellow funded by European Commission at University of Evora, Portugal from Nov. 2017 to Sept. 2018. He has more than eleven years of professional experience including two years in industry and nine years in academia out of which five years of research. Presently he is working as an Assistant Professor in the Dept. of Computer Science & Engineering, Aliah University, Kolkata. He has published more than 60 research articles in renowned journals and reputed national/international conferences. He is an active researcher in the field of Document Image Processing, Medical Image Analysis, Pattern Recognition, Machine Learning.

K.C. Santosh (Senior Member, IEEE) is an Assistant Professor and Graduate Program Director for the department of computer science at University of South Dakota (USD). Also, Dr. Santosh serves the School of Computing and IT, Taylor's University as a Visiting Associate Professor. Before joining USD, Dr. Santosh worked as a research fellow at the U.S. National Library of Medicine (NLM), National Institutes of Health (NIH). He worked as a postdoctoral research scientist at the LORIA research centre, Universite de Lorraine in direct collaboration with industrial partner ITESOFT, France. He also worked as a research scientist at the INRIA Nancy Grand Est research centre, France, where, he completed his PhD diploma in Computer Science. Before that, he worked as a graduate research scholar at SIIT, Thammasat University, Thailand. He published more than 120 peer-reviewed research articles; 2 authored books (Springer); and edited 10 books, journal issues, and conference proceedings. Dr. Santosh serves as an associate editor for the International Journal of Machine Learning & Cybernetics. Dr. Santosh demonstrated expertise in artificial intelligence, machine learning, pattern recognition, computer vision, image processing, data mining, and big data with various application domains, such as healthcare and medical imaging, document information content exploitation, biometrics, forensics, speech/audio analysis, satellite imaging, robotics, and Internet of Things.

Teresa Gonçalves is an assistant professor in the Department of Informatics at the University of Évora, Portugal. She has a PhD degree in Informatics from the University of Évora since 2008, having the 5 years degree and master in Informatics Engineering, both from Faculty of Sciences and Tecnology, New University of Lisbon in 1992 and 1996, respectively. She has published more than 60 research papers in reputed journal and conferences and worked as an organizing and programme committee chair of various international conferences. She worked as PI for different research and mobility projects funded by Portugal government and European commission. Her research interests include machine learning and data mining, namely with textual data and images, recommendation systems, and evolutionary algorithms.She is responsible for several courses of undergraduate, masters and doctorate level in Computer Science. Having successfully supervised two doctorate and six master students, currently she supervises six PhD and six master students mainly on applying and adapting machine learning approaches to text or image related problems.

Dr. Nibaran Das is an Associate Professor of the Department of Computer Science and Engineering at Jadavpur University. Before joining the Jadavpur University, from 2005 to 2006, Dr. Das worked as a lecturer in Techno India, Saltlake. He worked as a postdoctoral research scientist at the University of Evora for six months in between 2012-14. He also worked as a research intern at the Competence Center Multimedia Analysis and Data Mining (MADM) at the DFKI, University of Kaiserslautern, Germany in the year 2007. Dr. Das serves as an associate editor for the journal Sadhana: Academy Proceedings in Engineering Sciences. Dr. Das has demonstrated expertise in Deep Learning, pattern recognition; image processing and machine learning with various applications in handwriting recognition, especially character recognition, medical image analysis. Dr. Das published more than 125 research articles, including the books of Handbook of Research on Recent Developments in Intelligent Communication Application, IGI global and co-authoring several conference proceedings. He guided more than 30 master degree students in his department. Dr. Das Dr. Das served as a chairperson of the young professional affinity group, IEEE Kolkata section from 2014-2015. He is the founder editor of Bangla monthly computer magazine "Computer Jagat". He is a regular reviewer for high-quality journals (IEEE, Springer, and Elsevier) and high-quality conferences and workshops (sponsored by IEEE and Springer) in the domain.

Prof. Kaushik Roy has completed B.E in Computer Science & Engineering from NIT Silchar, M.E and PhD(Engg.) in Computer Science & Engg. from Jadavpur University in the year 1998, 2002 and 2008 respectively. He has worked as a project linked personnel in ISI-Kolkata and as a Scientific Officer in CDAC-Kolkata. He has also worked as an Assistant Professor in Maulana Abul Kalam Azad University of Technology, India formerly known as West Bengal University of Technology. He is currently working as a Professor and Head of the Department of Computer Science, West Bengal State University, Barasat, India. In 2004 he has received Young IT Professional award from Computer Society of India. He has published more than 150 research papers/book chapters in reputed conferences and journals. His research interest includes pattern recognition, document image processing, medical image analysis, online handwriting recognition, speech recognition and audio signal processing. He is Life Member of IUPRAI (an unit of IAPR) and Computer Society of India.

作者簡介(中文翻譯)

Sk Md Obaidullah 於 2017 年自 Jadavpur University 獲得工程博士學位,2009 年自 Calcutta University 獲得計算機科學與應用碩士學位,以及 2004 年自 Vidyasagar University 獲得計算機科學與工程學士學位。他曾於 2017 年 11 月至 2018 年 9 月期間,擔任歐洲委員會資助的 Erasmus 博士後研究員,於葡萄牙的 Evora 大學工作。他擁有超過十一年的專業經驗,其中包括兩年的產業經驗和九年的學術經驗,其中五年專注於研究。目前,他在 Kolkata 的 Aliah University 計算機科學與工程系擔任助理教授。他在知名期刊和國內外會議上發表了超過 60 篇研究文章。他在文檔影像處理、醫學影像分析、模式識別和機器學習等領域積極從事研究。

K.C. Santosh(IEEE 高級會員)是南達科他州大學(USD)計算機科學系的助理教授及研究生項目主任。此外,Santosh 博士還擔任 Taylor's University 計算與資訊學院的訪問副教授。在加入 USD 之前,Santosh 博士曾在美國國立醫學圖書館(NLM)和國立衛生研究院(NIH)擔任研究員。他在 LORIA 研究中心擔任博士後研究科學家,與法國的工業夥伴 ITESOFT 直接合作。他還曾在法國的 INRIA Nancy Grand Est 研究中心擔任研究科學家,並在那裡完成了計算機科學的博士學位。在此之前,他在泰國的 Thammasat University 擔任研究生研究學者。他發表了超過 120 篇經過同行評審的研究文章;出版了 2 本專著(Springer);並編輯了 10 本書籍、期刊專刊和會議論文集。Santosh 博士擔任《國際機器學習與網絡科學期刊》的副編輯。他在人工智慧、機器學習、模式識別、計算機視覺、影像處理、資料挖掘和大數據等領域展現了專業知識,並應用於醫療保健和醫學影像、文檔資訊內容開發、生物識別、法醫學、語音/音頻分析、衛星影像、機器人技術和物聯網等多個應用領域。

Teresa Gonçalves 是葡萄牙 Évora 大學資訊系的助理教授。她於 2008 年獲得 Évora 大學的資訊學博士學位,並於 1992 年和 1996 年分別獲得里斯本新大學科學與技術學院的資訊工程學士和碩士學位。她在知名期刊和會議上發表了超過 60 篇研究論文,並擔任多個國際會議的組織和程序委員會主席。她曾擔任多個由葡萄牙政府和歐洲委員會資助的研究和流動性項目的主要研究者。她的研究興趣包括機器學習和資料挖掘,特別是文本數據和影像、推薦系統和進化演算法。她負責多個本科、碩士和博士層級的計算機科學課程。她成功指導了兩位博士生和六位碩士生,目前主要指導六位博士生和六位碩士生,專注於將機器學習方法應用於文本或影像相關問題。

Dr. Nibaran Das 是 Jadavpur University 計算機科學與工程系的副教授。在加入 Jadavpur University 之前,Dr. Das 在 2005 年至 2006 年期間擔任 Techno India, Saltlake 的講師。他在 2012 至 2014 年期間於 Evora 大學擔任六個月的博士後研究科學家。他還於 2007 年在德國凱瑟斯勞滕大學的多媒體分析與資料挖掘(MADM)能力中心擔任研究實習生。Dr. Das 擔任期刊 Sadhana: Academy Proceedings in Engineering Sciences 的副編輯。Dr. Das 在深度學習、模式識別、影像處理和機器學習方面展現了專業知識,並在手寫識別(特別是字符識別)、醫學影像分析等多個應用領域中發揮作用。Dr. Das 發表了超過 125 篇研究文章,包括《智能通信應用近期發展研究手冊》(IGI global)和多篇會議論文的共同作者。他在其系指導了超過 30 位碩士生。Dr. Das 曾於 2014 至 2015 年擔任 IEEE Kolkata 區域年輕專業人員親和小組的主席。他是孟加拉月刊計算機雜誌《Computer Jagat》的創始編輯。他是高品質期刊(IEEE、Springer 和 Elsevier)及高品質會議和研討會(由 IEEE 和 Springer 贊助)的定期審稿人。

Prof. Kaushik Roy 於 1998 年自 NIT Silchar 獲得計算機科學與工程學士學位,並於 2002 年和 2008 年分別自 Jadavpur University 獲得計算機科學與工程碩士和博士學位。他曾在 ISI-Kolkata 擔任項目聯繫人員,並在 CDAC-Kolkata 擔任科學官。他還曾在印度的 Maulana Abul Kalam Azad University of Technology(前稱西孟加拉技術大學)擔任助理教授。目前,他在印度 Barasat 的西孟加拉州立大學擔任教授及計算機科學系主任。2004 年,他獲得印度計算機學會的年輕 IT 專業人員獎。他在知名會議和期刊上發表了超過 150 篇研究論文/書籍章節。他的研究興趣包括模式識別、文檔影像處理、醫學影像分析、在線手寫識別、語音識別和音頻信號處理。他是 IUPRAI(IAPR 的一個單位)和印度計算機學會的終身會員。

最後瀏覽商品 (1)