[1] S.
Abiteboul.
Querying
Semistructured Data.
Proceedings of the
International Conference on Database Theory (ICDT),
,
January 1997.
[2] B. Adelberg.
NoDoSE - A tool for
Semi-Automatically Extracting Semistructured Data from Text
Documents.
Proceedings ACM SIGMOD
International Conference on Management of Data, Seat-
tle, June 1998.
[3] D. E. Appelt, D.
J. Israel.
Introduction to
Information Extraction Technology.
Tutorial for IJCAI-99,
, August 1999.
[4] N. Ashish, C. A.
Knoblock.
Semi-automatic Wrapper
Generation for Internet Information Sources.
Second IFCIS
Conference on Cooperative Information Systems (CoopIS),
olina, June 1997.
[5] N. Ashish, C. A.
Knoblock.
Wrapper Generation for
semistructured Internet Sources.
SIGMOD Record, Vol.
26, No. 4, pp. 8--15, December 1997.
[6] P. Atzeni, G.
Mecca.
Cut & Paste.
Proceedings of the
16‘th ACM SIGACT-SIGMOD-SIGART Symposium on Principles
of
Database Systems (PODS‘97), , May 1997.
[7] M. Bauer, D.
Dengler.
TrIAs - An
Architecture for Trainable Information Assistants.
Workshop on AI and
Information Integration, in conjunction with the 15‘th National
Conference on
Artificial Intelligence (AAAI-98), , July 1998.
[8] P. Berka.
Intelligent Systems on
the Internet.
http://lisp./
berka/ai-inet.htm, Laboratory of Intelligent Systems, University
of
Economics,
[9] L. Bright, J. R.
Gruser, L. Raschid, M. E. Vidal.
A
Wrapper Generation Toolkit to Specify and Construct Wrappers for Web
Accessible
Data Sources
(WebSources).
Computer Systems
Special Issue on Semantics on the WWW, Vol. 14 No. 2, March
1999.
[10] S. Brin.
Extracting Patterns
and Relations from the World Wide Web.
International Workshop
on the Web and Databases (WebDB‘98), , March 1998.
[11] M. E. Califf, R.
J. Mooney.
Relational Learning of
Pattern-Match Rules for Information Extraction.
Proceedings of the ACL
Workshop on Natural Language , July 1997.
[12] M. E. Califf.
Relational Learning
Techniques for Natural Language Information Extraction.
Ph.D. thesis,
Department of Computer Sciences, , August
1998. Technical Report
AI98-276.
[13] S. Chawathe, H.
Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J.
Ullman, J. Widom.
The TSIMMIS Project:
Integration of Heterogeneous Information Sources.
In
Proceedings of IPSJ Conference, pp. 7--18, , Japan,
October 1994.
[14] B. Chidlovskii,
U. M. Borghoff, P-Y. Chevalier.
Towards Sophisticated
Wrapping of Web-based Information Repositories.
Proceedings of the
5‘th International RIAO Conference, , June 1997.
[15] M. Craven, D.
DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, S.
Slattery.
Learning to Extract
Symbolic Knowledge from the World Wide Web.
Proceedings of the
15‘th National Conference on Artificial Intelligence (AAAI-98),
, , July 1998.
[16] M. Craven, S.
Slattery, K. Nigam.
First-Order Learning
for Web Mining.
Proceedings of the
10‘th European Conference on Machine , April
1998.
[17] R. B. Doorenbos,
O. Etzioni, D. S. Weld.
A
Scalable Comparison-Shopping Agent for the World Wide Web.
Technical report
UW-CSE-,
, 1996.
[18] R. B. Doorenbos,
O. Etzioni, D. S. Weld.
A
Scalable Comparison-Shopping Agent for the
World-Wide-Web.
Proceedings of the
first International Conference on Autonomous Agents, ,
February 1997.
[19] O. Etzioni
Moving up the
Information Food Chain: Deploying Softbots on the World Wide Web.
AI
Magazine, 18(2):11-18, 1997.
[20] D. Florescu, A.
Levy, A. Mendelzon.
Database Techniques
for the World Wide Web: A Survey.
ACM SIGMOD Record,
Vol. 27, No. 3, September 1998.
[21] D. Freitag.
Information Extraction
from HTML: Application of a General Machine Learning Ap-
proach.
Proceedings of the
15‘th National Conference on Artificial Intelligence (AAAI-98),
, , July 1998.
[22] D. Freitag.
Machine Learning for
Information Extraction in Informal Domains.
Ph.D. dissertation,
, November 1998.
[23] D. Freitag.
Multistrategy Learning
for Information Extraction.
Proceedings of the
15‘th International Conference on Machine Learning (ICML-98),
, , July 1998.
[24] R. Gaizauskas, Y.
Wilks.
Information
Extraction: Beyond Document Retrieval.
Computational
Linguistics and Chinese Language Processing, vol. 3, no. 2, pp.
17--60,
August 1998,
[25] H. Garcia-Molina,
J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, J.
Widom.
Integrating and
Accessing Heterogeneous Information Sources in TSIMMIS.
In
Proceedings of the AAAI Symposium on Information Gathering, pp.
61--64, Stan-
ford, ,
March 1995.
[26] S. Grumbach and
G. Mecca.
In
Search of the Lost Schema.
Proceedings of the
International Conference on Database Theory (ICDT‘99),
, January 1999.
[27] J-R. Gruser, L.
Raschid, M. E. Vidal, L. Bright.
Wrapper Generation for
Web Accessible Data Source.
Proceedings of the
3‘rd IFCIS International Conference on Cooperative Information
Systems (CoopIS-98),
New
York, August 1998.
[28] J. Hammer, H.
Garcia-Molina, J. Cho, R. Aranha, A. Crespo.
Extracting
Semistructured Information from Web.
Proceedings of the
Workshop on Management of Semistructured Data, , Ari-
zona, May 1997.
[29] J. Hammer, H.
Garcia-Molina, S. Nestorov, R. Yerneni, M. Breunig, V. Vassalos.
Template-Based
Wrappers in the TSIMMIS System.
Proceedings of the
26‘th SIGMOD International Conference on Management of Data,
, , May 1997.
[30] C-H. Hsu.
Initial Results on
Wrapping Semistructured Web Pages with Finite-State Transducers
and Contextual Rules.
Workshop on AI and
Information Integration, in conjunction with the 15‘th National
Conference on
Artificial Intelligence (AAAI-98), , July 1998.
[31] C-H. Hsu and M-T
Dung.
Generating Finite-Sate
Transducers for semistructured Data Extraction From the
Web.
Information systems,
Vol 23. No. 8, pp. 521--538, 1998.
[32] C. A. Knoblock,
S. Minton, J. L. Ambite, N. Ashish, P. J. Modi, I. Muslea, A. G.
Philpot, S. Tejada.
Modeling Web Sources
for Information Integration.
Proceedings of the
15‘th National Conference on Artificial Intelligence (AAAI-98),
, , July 1998.
[33] N. Kushmerick, D.
S. Weld, R. Doorenbos.
Wrapper Induction for
Information Extraction.
15‘th International
Joint Conference on Artificial Intelligence (IJCAI-97), ,
August 1997.
[34] N. Kushmerick.
Wrapper Induction for
Information Extraction.
Ph.D. Dissertation,
. Technical Report
UW-CSE-,
1997.
[35] N. Kushmerick.
Wrapper induction:
Efficiency and expressiveness.
Workshop on AI and
Information Integration, in conjunction with the 15‘th National
Conference on
Artificial Intelligence (AAAI-98), , July 1998.
[36] Kushmerick, N.
Gleaning the Web.
IEEE Intelligent
Systems, 14(2), March/April 1999.
[37] S. Lawrence, C.l.
Giles.
Searching the World
Wide Web.
Science magazine, v.
280, pp. 98--100, April 1998.
[38] A. Y. Levy, A.
Rajaraman, J. J. Ordille.
Querying
Hetereogeneous Information Sources Using Source Descriptions.
Proceedings 22‘nd VLDB
Conference, , September 1996.
[39] S. Muggleton, C.
Feng.
Efficient Induction of
Logic Programs.
Proceedings of the
First Conference on Algorithmic Learning Theory, ,
1990.
[40]
Extraction Patterns:
From Information Extraction to Wrapper Induction.
Information Sciences
Institute, , 1998.
[41]
Extraction Patterns
for Information Extraction Tasks: A Survey.
Workshop on Machine
Learning for Information Extraction, , July 1999.
[42] Muslea, S. Minton, C. Knoblock.
STALKER: Learning
Extraction Rules for Semistructured, Web-based Information
Sources.
Workshop on AI and
Information Integration, in conjunction with the 15‘th National
Conference on
Artificial Intelligence (AAAI-98), , July 1998.
[43] Muslea, S. Minton, C. Knoblock.
Wrapper Induction for
Semistructured Web-based Information Sources.
Proceedings of the
Conference on Automatic Learning and Discovery CONALD-98,
, June 1998.
[44] Muslea, S. Minton, C. Knoblock.
A
Hierarchical Approach to Wrapper Induction.
Third International
Conference on Autonomous Agents, (Agents‘99), Seattle, May
1999.
[45] S. Nestorov, S.
Aboteboul, R. Motwani.
Inferring Structure in
Semistructured Data.
Proceedings of the
13‘th International Conference on Data Engineering (ICDE‘97),
, , April
1997.
[46] STS Prasad, A.
Rajaraman.
Virtual Database
Technology, XML, and the Evolution of the Web.
Data Engineering, Vol.
21, No. 2, June 1998.
[47] J.R. Quinlan, R.
M. Cameron-Jones.
FOIL: A Midterm
Report.
European Conference on
Machine Learning, , 1993.
[48] A. Rajaraman.
Transforming the
Internet into a Database.
Workshop on Reuse of
Web information, in conjunction with WWW7, Brisbane, April
1998.
[49] A. Sahuguet, F.
Azavant.
WysiWyg Web Wrapper
Factory (W
http://cheops.cis./
sahuguet/WAPI/wapi.ps.gz,
nia, August 1998.
[50] D. Smith, M.
Lopez.
Information Extraction
for Semistructured Documents.
Proceedings of the
Workshop on Management of Semistructured Data, in conjunction
with PODS/SIGMOD,
,
, May 1997.
[51] S. Soderland.
Learning to Extract
Text-based Information from the World Wide Web.
Proceedings of the
3‘rd International Conference on Knowledge Discovery and Data
Mining (KDD),
, August 1997.
[52] S. Soderland.
Learning Information
Extraction Rules for Semistructured and Free Text.
Machine Learning,
1999.
[53] K. Zechner.
A
Literature Survey on Information Extraction and Text Summarization.
Term paper, , 1997.
[54] About mySimon.
http://www./about
mysimon/company/backgrounder.anml
|