DBLP URL

(not all are mine): http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/f/Fan:Wei.html

Google Scholar

my google scholar and using an old name

Selected Publications

Ph.D. Thesis/Book

[2001] "Cost-sensitive, Scalable and Adaptive Learning using Ensemble-based Methods", supervised by Professor Salvatore J. Stolfo. Applications are in fraud-detection and intrusion detection systems. Now available as published book from either Amazon or Morebooks

Major Conferences

[2011]    Jing Peng, Costin Barbu, Guna Seetharaman, Wei Fan, Xian Wu and Kannappan Palaniappan, "ShareBoost: Boosting for Multi-View Learning with Performance Guarantees", 2011 European Conference on Machine Learning and Principle and Practice of Knowledge Discovery in Databases (ECML PKDD'2011), Athens, Greece, September, 2011

[2011]    Xiangnan Kong, Wei Fan and Philip Yu, "Dual Active Feature and Sample Selection for Graph Classification", 2011 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), San Diego, USA

[2011]    Xiaoxiao Shi, Wei Fan, Jianping Zhang, and Philip Yu, "Discoverying Shaker from Evolving Entities via Cascading Graph Inference", 2011 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), San Diego, USA

[2011]    Rita Chattpadhyay, Jieping Ye, Sethuraman Panchanathan, Wei Fan, and Ian Davidson, "Multi-Source Domain Adaptation and Its Application to Early Detection of Fatigue", 2011 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), San Diego, USA

[2011]    Jing Gao, Wei Fan, Deepak S. Turaga, Olivier Verscheure, Xiaoqiao Meng, Lu Su, and Jiawei Han, "Consensus Extraction from Heterogeneous Detectors to Improve Performance over Network Traffic Anomaly Detection" , 30th IEEE International Conferece on Computer Communications, Joint Conference of the IEEE Computer and Communications Societies (INFOCOM'2011), April 2011, Shanghai, China. The Powerpoint presentation is here

[2011]    Jing Peng, Guna Seetharaman, Wei Fan, Stefan Rubila, and Aparna Varde, Chernoff Dimensionality Reduction - Where Fisher Meets FKT", 2011 SIAM International Conference on Data Mining (SDM'11), Phoenix, Arizona.

[2010]    Xiaoxiao Shi, Qi Liu, Wei Fan, Philip Yu, and Ruixin Zhu, "Transfer Learning on Heterogenous Feature Spaces via Spectral Transformation", 2010 IEEE International Conference on Data Mining (ICDM'10), Sydney, Australia, December 2010. The Powerpoint presentation is here.

[2010]   Xiaoxiao Shi, Wei Fan, and Philip Yu, "Efficient Semi-supervised Spectral Co-clustering with Constraints", 2010 IEEE International Conference on Data Mining (ICDM'10), Sydney, Australia, December, 2010. The Powerpoint presentation is here

[2010]    Jing Peng, Stefan A. Robila, Wei Fan, and Guna Seetharaman, "Analysis of Chernoff Criterion for Linear Dimensionality Reduction", 2010 IEEE International Conference on Systems, Man and Cybernetics (SMC'2010), Istanbul, Turkey, October 2010.

[2010]   Wei Fan, and Xiaoming Li, "Data Mining and Modeling Challenges for Internet of Things (IoT)" (in Chinese), Communications of Chinese Computer Society, September, 2010. Vol 6:9

[2010]   Sihong Xie, Wei Fan, Olivier Verscheure, and Jiangtao Ren, "Efficient and Numerically Stable Sparse Learning", 2010 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databaes (ECML/PKDD'2010), Barcelona, Spain. [pp] [code]

[2010]   Erheng Zhong, Wei Fan, Qiang Yang, Olivier Verscheure, and Jiangtao Ren, "Cross Validation Framework to Choose Amongst Models and Datasets for Transfer Learning", 2010 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD'2010), Barcelona, Spain. The Powerpoint presentation is here

[2010]   Jing Gao, Feng Liang, Wei Fan, Chi Wang, Yizhou Sun, and Jiawei Han, "On Community Outliers and their Efficient Detection in Information Networks , 2010 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 2010. The Powerpoint presentation is here

[2010]   Hillol Kargupta, Joao Gama, and Wei Fan, "The Next Generation of Transportation Systems, Greenhouse Emissions, and Data Mining", 2010 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Panel Summary), Washington, DC, USA, 2010.

[2010]   Xia Tian Zhang, Quan Yuan, Shiwan Zhao, Wei Fan, Wen Tao Zheng, and Dong Wang, "Multi-label Learning without Multi-label Cost", 2010 SIAM International Conference on Data Mining, Columbus, OH, April, 2010. The Powerpoint presentation can be found here. The open source code on RDT used in this paper can be found www.dice4dm.com

[2010]    Xiaoxiao Shi, Qi Liu, Wei Fan, Qiang Yang, and Philip Yu, "Predictive Modeling with Heterogeneous Sources", 2010 SIAM International Conference on Data Mining, Columbus, OH, April, 2010. The Powerpoint presentation can be found here, and the code can be downloaded from here.

[2010]    Wei Fan, Erheng Zhong, Jing Peng, Olivier Verscheure, Kun Zhang, Jiangtao Ren, Rong Yan, and Qiang Yang, "Generalized and Heuristic-Free Feature Construction", 2010 SIAM International Conference on Data Mining, Columbus, OH, April, 2010. The Powerpoint presentation can be found here. The code and dataset can be found at here.

[2009]   Jing Gao, Feng Liang, Wei Fan, Yizhou Sun, and Jiawei Han, "Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models", 23rd Annual Conference on Neural Information Processing Systems (NIPS'09), Vancouver, Canada, Dcember 2009. The Powerpoint presentation can be found here.

[2009]   Bo Wang, Jie Tang, Wei Fan, Songcan Chen, Zi Yang, Yanzhu Liu, "Heterogeneous Cross Domain Ranking in Latent Space", Eighteenth ACM conference on Information and Knowledge Management (CIKM'10), Hong Kong, November, 2010. The Powerpoint presentation can be found here.

[2009]   Erheng Zhong, Wei Fan, Jing Peng, Olivier Verscheure, and Jiangtao Ren, "Universal Learning over Related Distributions and Adaptive Graph Transduction", The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2009) September 7-11, 2009, Bled, Slovenia. The Powerpoint presentation can be found here . The code and datasets can be found here .

[2009]   Xiaoxiao Shi, Wei Fan, Qiang Yang and Jiangtao Ren, "Relaxed Transfer of Different Classes via Spectral Partition", The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2009) September 7-11, 2009, Bled, Slovenia. The Powerpoint presentation can be found here

[2009]   Erheng Zhong, Wei Fan, Jing Peng, Kun Zhang, Jiangtao Ren, and Olivier Verscheure, "Cross Domain Distribution Adaptation via Kernel Mapping", 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). The Powerpoint presentation can be found here, the source code and dataset can be found here.

[2009]   Jing Gao, Wei Fan, Yizhou Sun, and Jiawei Han, "Heterogeneous Source Consensus Learning via Decision Propagation and Negotiation", 15th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). The Powerpoint presentation can be found here, more information about the work can be found here.

[2009]   Sihong Xie, Wei Fan, Jing Peng, Olivier Verscheure, and Jiangtao Ren, "Latent Space Domain Transfer between High Dimensional Overlapping Distributions" , 2009 18th International World Wide Web Conference (WWW'09), Madrod, Spain. The software and data used in this paper can be downloaded here. The Powerpoint presentation can be found here.

[2008]   Erheng Zhong, Sihong Xie, Wei Fan, Jiangtao Ren, Jing Peng, and Kun Zhang, "Graph-based Iterative Hybrid Feature Selection", 2008 IEEE International Conference on Data Mining (ICDM'08), Pisa, Italy. The software and data used in this paper can be downloaded here. The Powerpoint presentation can be found here.

[2008]   Xiaoxiao Shi, Wei Fan, and Jiangtao Ren, "Actively Transfer Domain Knowledge", 2008 European Confernce on Machine Learning and Principles and Practices of Knowledge Discovery in Databases (ECML/PKDD08), Antwerp, Belgium. The Powerpoint presentation is here. The code and dataset (synthetic and landmine) written by Xiaoxiao Shi can be found at here, and the 20 Newsgroup data can be found at here.

[2008]   Wei Fan, Kun Zhang, Hong Cheng, Jing Gao, Jiawei Han, Philip Yu, and Olivier Verscheure, "Direct Mining of Discriminative and Essential Frequent Patterns via Model-based Search Tree", The Powerpoint presentation, and poster, 2008 ACMKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), Las Vegas, NE, USA. The software package for Linux environment can be founded here.

[2008]   Jing Gao, Wei Fan, Jing Jian, and Jiawei Han, "Knowledge Transfer via Multiple Model Local Structure Mapping", 2008 ACMKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), the conference Powerpoint presentation, the longer presentation, and poster, Las Vegas, NE, USA. The software and code package written by Jing Gao can be found here.

[2008]   Jiangtao Ren, Xiaoxiao Shi, Wei Fan, and Philip S. Yu "Type Independent Correction of Sample Selection Bias via Structural Discovery and Re-balancing", Two page PowerPoint Summary, and Long Presentation as well as DataSet and Code, 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, Apr 2008.

[2008]   Jiangtao Ren, Zhengyuan Qiu, Wei Fan, Hong Cheng, and Philip S. Yu "Forward Semi-supervised Feature Selection" , Powerpoint presentation, the dataset and software written by Zhengyuan Qiu, 2008 Pacific-Asian Conference on Knowledge Discovery and Data Mining (PAKDD'08), Osaka, Japan, April 2008.

[2007]   Jing Gao, Wei Fan, and Jiawei Han "On Appropriate Assumptions to Mine Data Streams: Analysis and Practice", Presentation PowerPoint. 2007 IEEE International Conference on Data Mining (ICDM'07), Omaha, NE, Oct 2007.

[2007]   Kun-Lung Wu, Philip S. Yu, Bugra Gedik, Kirsten Hildrum, Charu Aggarwal, Eric Bouillet, Wei Fan, David George, Xiaohui Gu, Gang Luo, Haixun Wang "Challenges and Experiences in Prototyping a Multi-Modal Stream Analytic and Monitoring Application on System S", 2007 Very Large DataBase Confrence (VLDB'07), Vienna, Austria, Sep 2007 (Industry Track).

[2007]   Wei Fan, and Ian Davidson "On Sample Selection Bias and Its Efficient Correction via Model Averaging and Unlabeled Examples," 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, April 2007. Click here for PowerPoint presentation (many animations).

[2007]   Jing Gao, Wei Fan, Jiawei Han, and Philip S. Yu, "A General Framework for Mining Concept-Drifting Streams with Skewed Distribution", Presentation PowerPoint 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, April 2007.

[2006]   Kun Zhang, Wei Fan, Xiaojing Yuan, Ian Davidson, and Xiangshang Li, "Forecasting Skewed Stochastic Biased Ozone Days: Analyses and Solutions" , 2006 IEEE International Conference on Data Mining (ICDM'2006), December 2006. Best Paper Award: Application Category . Presentation PowerPoint (lots of animations).

         An extended journal version of this paper comparing RDT (random decision trees) with SVM, AdaBoosting and a number of other popular learning algorithms for this streaming problem can be found here .

         The ozone dataset is available for research use by sending a request to either Dr. Kun Zhang (zhang.kun05@gmail.com) or Wei Fan (wei.fan@gmail.com). We will provide you with a secured download link.

[2006]   Kun Zhang, Wei Fan, Bill Buckles, Xiaojing Yuan, and Zujia Xu: "Discovering Unrevealed Properties of Probability Estimation Trees: On Algorithm Selection and Performance Explanation", The PowerPoint presentation by Dr. Kun Zhang is here. 2006 IEEE International Conference on Data Ming (ICDM'2006), December 2006.

[2006]   Ian Davidson, and Wei Fan, "When Efficient Model Averaging Out-performs Boosting and Bagging", PKDD 2006, Berlin, September 2006. Click here for its PowerPoint presentation.

[2006]   Wei Fan, Joe McClosky, and Philip S. Yu, "A General Framework for Accuracy and Fast Regression by Data Summarization in Random Decision Trees", KDD2006, Philadelphia, August 2006. Click here for PowerPoint presentation.

[2006]   Wei Fan, and Ian Davidson "ReverseTesting: An Efficient Framework to Select Amongst Classifiers under Sample Selection Bias", KDD2006, Philadelphia, August 2006. Click here for PowerPoint presentation.

[2005]   Wei Fan, Ed Greengrass, Joe McCloskey, and Philip S. Yu, "Effective Estimation of Posterior Probabilities: Explaining the Accuracy of Randomized Decision Tree Approaches". PowerPoint presentation. The Fifth IEEE International Conference on Data Mining (ICDM'05), Houston, Texas, November 2005.

[2005]   Wei Fan, Ian Davidson, Bianca Zadrozny, and Philip S. Yu "An Improved Categorization of Classifier's Sensitivity on Sample Selection Bias.", PowerPoint presentation, The Fifth IEEE International Conference on Data Mining (ICDM'05), Houston, Texas, November 2005. The long version including experimental results can be found here.

[2005]   Wei Fan, Janek Mathuria, and Chang-tien Lu "Making Data Mining Models Useful to Model Non-paying Customers of Exchange Carriers" (short paper), PowerPoint presentation. 2005 SIAM International Conference on Data Mining (SDM'05), Newport Beach, CA, April 2005.

[2005]   Fei Tony Liu, Kai Ming Ting, and Wei Fan Maximizing Tree Diversity by Building Complete-Random Decision Trees" (short paper), 2005 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'05), Hanoi, Vietnam, May 2005.

[2004]   Wei Fan "On the Optimality of Probability Estimation by Random Decision Trees", PowerPoint presentation, The Nineteenth National Conference on Artificial Intelligence (AAAI'04), San Jose, CA, July, 2004.

[2004]   Wei Fan "Systematic Data Selection to Mine Concept-drifting Data Streams"", The PowerPoint presentation is here. The Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Seattle, Washington, August 2004.

[2004]   Wei Fan, Yi-an Huang, Haixun Wang, and Philip S. Yu "Active Mining of Data Streams" (short paper), 2004 SIAM International Conference on Data Mining (SDM'04), Orlando, Florida, April 2004.

[2004]   Wei Fan, Yi-an Huang, and Philip S. Yu "Decision Tree Evolution with Limited Number of Examples from Drifting Data Streams" (short paper), PowerPoint is here. The Fourth IEEE International Conference on Data Mining (ICDM'04), Brighton, UK, November 2004.

[2004]   Wei Fan, Philip S. Yu, and Haixun Wang "Mining Extremely Skewed Trading Anomalies" (industrial track), The Ninth International Conference on Extending Database Technology (EDBT'04), Heraklion, Greece, March 2004.

[2004]   Wei Fan, "StreamMiner: A Classifier-Ensemble based Engine to Mine Concept-drifting Data Streams" (demo), The 2004 International Conference on Very Large Databases (VLDB'04), Toronto, Canada, August 2004.

[2004]   Haixun Wang, Fang Chu, Wei Fan, and Philip S. Yu "Fast Algorithm for Subspace Clustering by Pattern Similarity", 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 21-23 June 2004, Santorini Island, Greece.

[2003]   Yi-an Huang, Wei Fan, Wenke Lee, Philip S. Yu "Cross-Feature Analysis for Detecting Ad-Hoc Routing Anomalies", click the expanded version. In 23rd International Conference on Distributed Computing Systems (ICDCS 2003), 19-22 May 2004, Providence, RI.

[2003]   Haixun Wang, Chang-Shing Perng, Wei Fan, Sanghyun Park, Philip S. Yu Indexing Weighted-Sequences in Large Databases'', 19th IEEE International Conference on Data Engineering (ICDE 2003), March 5-8, 2003, Bangalore, India: 63-74.

[2003]   Wei Fan, Haixun Wang, Philip S. Yu, Sheng Ma "Is random model better? On its accuracy and efficiency", PowerPoint presentation, 3rd IEEE International Conference on Data Mining (ICDM 2003), November 19-22, Melbourne, FL: 51-58.

The algorithm description and its source code is available here.

[2003]   Wei Fan, Haixun Wang, Philip S. Yu, Shaw-hwa Lo, "Inductive Learning in Less Than One Sequential Data Scan", PowerPoint presentation. 18th International Joint Conference on Artificial Intelligence (IJCAI 2003), August 15-19, 2003, Acapulco, Mexico: 595-600.

[2003]   Haixun Wang, Wei Fan, Philip S. Yu, Jiawei Han "Mining concept-drifting data streams using ensemble classifiers", 9th ACM International Conference on Knowledge Discovery and Data Mining (KDD 2003), August 24-27, 2003, Washington DC: 226-235.

[2003]   Haixun Wang, Sanghyun Park, Wei Fan, Philip S. Yu, "ViST: A Dynamic Index Method for Querying XML Data by Tree Structures" 2003 ACM SIGMOD International Conference on Management of Data (SIGMOD 2003): 110-121.

[2002]   Wei Fan, Fang Chu, Haixun Wang, Philip S. Yu, "Pruning and Dynamic Scheduling of Cost-Sensitive Ensembles", Here is the PowerPoint presentation, Eighteenth National Conference on Artificial Intelligence (AAAI 2002): 146-151.

[2002]   Haixun Wang, Chang-Shing Perng, Wei Fan, Philip S. Yu ``An Index Structure for Pattern Similarity Searching in DNA Microarray Data'' , 1st IEEE Bioinformatics Conference CSB 2002: 256-267.

[2002]   Wei Fan, Haixun Wang, Philip S. Yu, Salvatore J. Stolfo "A Fully Distributed Framework for Cost-Sensitive Data Mining" (short paper, long version provided). The PowerPoint presentation. 22nd International Conference on Distributed Computing Systems (ICDCS 2002), July 2-5, 2002, Vienna, Austria: 445-446.

[2002]   Wei Fan, Haixun Wang, Philip S. Yu, Shaw-hwa Lo, Salvatore J. Stolfo, Progressive Modeling. Here is the PowerPoint prsentation. 2nd IEEE International Conference on Data Mining (ICDM 2002), December 9-12, 2003, Maebashi, Japan: 163-170.

[2002]   Naoki Abe, Edwin P. D. Pednault, Haixun Wang, Bianca Zadrozny, Wei Fan, Chidanand Apte, "Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing", 2nd IEEE International Conference on Data Mining (ICDM 2002), December 9-12, 2002, Maebashi, Japan: 3-10.

[2002]   Edwin Pednault, Naoki Abe, Bianca Zadrozny, Haixun Wang, Wei Fan, and Chidanand Apte, "Sequential cost-sensitive decision making with reinforcement learning" , 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'02).

[2002]   Wei Fan, Haixun Wang, Philip S. Yu, Salvatore J. Stolfo, "A Framework for Scalable Cost-sensitive Learning Based on Combing Probabilities and Benefits", PowerPoint presentation. 2nd SIAM International Conference on Data Mining (SDM 2002), April 11-13, 2002, Arlington, VA.

[2002]   Wei Fan, Salvatore J. Stolfo "Ensemble-based Adaptive Intrusion Detection", PowerPoint is here. 2nd SIAM International Conference on Data Mining (SDM 2002), April 11-13, 2002, Arlington, VA.

[2001]   Wei Fan, Matthew Miller, Salvatore J. Stolfo, Wenke Lee, Philip K. Chan, "Using Artificial Anomalies to Detect Unknown and Known Network Intrusions", PowerPoint presention. First IEEE International Conference on Data Mining (ICDM 2001), 29 Nov- 1 Dec, 2001, San Jose, CA: 123-130.

[2001]   Wenke Lee, Salvatore J. Stolfo, Philip K. Chan, Eleazar Eskin, Wei Fan, Matthew Miller, Shlomo Hershkop, and Junxin Zhang "Real-time Data Mining-based Intrusion Detection". Second DARPA Information Survivability and Exposition 2001 (DISCEX'01), June 2001, pp. 89-100.

[2000]   Wei Fan, Wenke Lee, Salvatore Stolfo, Matthew Miller "A Multiple Model Cost-Sensitive Approach for Intrusion Detection". The Eleventh European Conference on Machine Learning (ECML00).

[2000]   William Cohen, Wei Fan "Web-Collaborative Filtering: Recommending Music by Spidering The Web" The Ninth International Conference on World Wide Web (WWW'99).

[2000]   Salvatore J. Stolfo, Wenke Lee, Wei Fan, Andreas Prodromidis, and Philip K. Chan "Cost-based Modeling for Fraud and Intrusion Detection: Results from the JAM Project" , DARPA Information Survivability Conference 2000.

[1999]   William Cohen, Wei Fan "Learning Page-independent Heuristics for Extracting Data from Web Pages", The Eighth International Conference on World Wide Web (WWW'00). Slightly extended journal version is here.

[1999]   Wei Fan, Salvatore J. Stolfo, Junxin Zhang and Philip Chan, "AdaCost: Misclassification Cost-sensitiveBoosting". Proceedings of the Sixteenth International Conference on Machine Learning (ICML'99), pp.97-105, Bled, Slovenia, June 1999.

[1999]   Wei Fan, Salvatore J. Stolfo and Junxin Zhang, "The Application of AdaBoost for Distributed, Scalable and On-line Learning" Proceedings of ACM Sixth International Conference on Knowledge Discovery and Data Mining (KDD'99). pp.362-366, San Diego, California, August 1999.

[1997]   Salvatore J. Stolfo, Andreas Prodromidis, Shelley Tselepis, Wenke Lee, Wei Fan, and Philip Chan, "JAM - Java Agents of Meta-learning over Distributed Networks" Proceeding of Third International Conference on Knowledge Discovery and Data Mining (KDD-97), pp.74 - 81, Newport Beach, California, August 1997. Runner-up best application paper . The JAM Project is a featured article in Trusted Information Systems Magazine, March, 1998

[1996]   Jose Moreira, Vijay Naik and Wei Fan ``Design and Implementation of Computational Steering for Parallel Scientific Applications'', Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, Minneapolis, Minnesota, March 14--17, 1997.

Journal and Magazine

[2011]    Wensheng Zhang, Andrea Edwards, Wei Fan, Prescott Deininger, and Kun Zhang "Alu Distribution and Mutation Types of Cancer Genes", BMC Genomics 2011, 12:157

[2010]    Wensheng Zhang, Andrea Edwards, Wei Fan, Dongxiao Zhu, and Kun Zhang "svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification", BMC Bioinformatics 11:338 (2010)

[2009]   Kun Zhang, Wei Fan, Prescott Deininger, Andrea Edwards, Zujia Xu, Dongxiao Zhu: "Breaking the Computational Barrier: a divide-conquer and aggregate based approach for ALU insertion site characterisation", International Journal of Computational Biolog and Drug Design, Vol 2, No 4, 2009

[2008]   Jing Gao, Bolin Ding, Wei Fan and Jiawei Han: "Classifying Data Streams with Skewed Distribution and Concept-drifts", IEEE Internet Computing. accepted to appear.

[2008]   Kun Zhang, and Wei Fan: "Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond". Knowl. Inf. Syst. 14(3): 507-527 (2008)

[2004]   Wei Fan, Matthew Miller, Salvatore J. Stolfo, Wenke Lee, Philip K. Chan: "Using artificial anomalies to detect unknown and known network intrusions". Knowl. Inf. Syst. 6(5): 507-527 (2004)

[2002]   Wenke Lee, Wei Fan, Matthew Miller, Salvatore J. Stolfo, Erez Zadok "Toward Cost-Sensitive Modeling for Intrusion Detection and Response". Journal of Computer Security 10(1/2): 5-22 (2002)

[2001]   Wenke Lee, Wei Fan: Mining System Audit Data: Opportunities and Challenges. SIGMOD Record 30(4): 35-44 (2001)

[2001]   Salvatore J. Stolfo, Wenke Lee, Philip K. Chan, Wei Fan, Eleazar Eskin: "Data Mining-based Intrusion Detectors: An Overview of the Columbia IDS Project". SIGMOD Record 30(4): 5-14.

[2000]   William W. Cohen, Wei Fan: ``Web-collaborative filtering: recommending music by crawling the Web''. Computer Networks 33(1-6): 685-698 (2000)

[1999]   Philip Chan, Andreas Prodromidis, Wei Fan, and Salvatore J. Stolfo, "Distributed Data Mining in Credit Card Fraud Detection", IEEE Intelligent Systems Journal November/December 1999, pp.67-74.

[1999]   William Cohen and Wei Fan "Learning Page-independent Heuristics for Extracting Data from Web-pages" , International Journal of Computer and Telecommunication Networking, Vol.31, pp.1641-1652. (slightly extended version from WWW'99)

Workshops

[1999]   Wei Fan, Salvatore J. Stolfo, Philip K. Chan, "Using Conflicts Among Multiple Base Classifiers to Measure the Performance of Stacking", ICML-99 Workshop on Recent Advances in Meta-learning and Future Work, Bled, Slovenia, 1999.

[1997]   Salvatore J. Stolfo, Wei Fan, Wenke Lee, and Andreas Prodromidis Credit Card Fraud Detection using Meta-learning: Issues and Initial Results", AAAI'97 Workshop on Fraud Detection and Risk Management, Providence, Rhode Island, USA, 1997.

[1996]   Wei Fan, Philip K. Chan, and Salvatore J. Stolfo A Comparative Evaluation of Combiner and Stacked Generalization, AAAI'96 Workshop on Combining Multiple Models, Portland, Oregon, USA, 1996.

Technical Report

[2008]   Julia Stoyanovich, Kenneth A. Ross, Jun Rao, Wei Fan, Volker Markl, and Guy Lohman, ReoptSMART: A Learning Query Plan Cache, Columbia University Computer Science Technical Report cucs-023-08

[1999]   Salvatore J. Stolfo, Wei Fan, Wenke Lee, Andreas Prodromidis, "Cost-based Modeling and Evaluation for Data Mining with Application to Fraud and Intrusion Detection: Results from the JAM Project", Columbia University Computer Science 1999.

[1998]   Wei Fan, and Salvatore J. Stolfo, Recurisve Stacking to Improve the Accuracy of Combined Classifiers, Columbia University Computer Science 1998.

[1997]   Salvatore J. Stolfo, Wei Fan, Andreas Prodromidis, Wenke Lee, and Shelley Tselepis, Agent-based Fraud and Intrusion Detection Systems in Financial Information Systems, Columbia University Computer Science 1997.