Æȷοì
Marc Najork
Marc Najork
Google DeepMind
google.comÀÇ À̸ÞÀÏ È®ÀÎµÊ - ȨÆäÀÌÁö
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Detecting spam web pages through content analysis
A Ntoulas, M Najork, M Manasse, D Fetterly
Proceedings of the 15th international conference on World Wide Web, 83-92, 2006
8942006
Mercator: A scalable, extensible web crawler
A Heydon, M Najork
World Wide Web 2 (4), 219-229, 1999
8661999
A large-scale study of the evolution of web pages
D Fetterly, M Manasse, M Najork, J Wiener
Proceedings of the 12th international conference on World Wide Web, 669-678, 2003
8332003
Breadth-first crawling yields high-quality pages
M Najork, JL Wiener
Proceedings of the 10th international conference on World Wide Web, 114-118, 2001
6322001
Web crawling
C Olston, M Najork
Foundations and Trends® in Information Retrieval 4 (3), 175-246, 2010
6122010
Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages
D Fetterly, M Manasse, M Najork
Proceedings of the 7th International Workshop on the Web and Databases ¡¦, 2004
4732004
On near-uniform URL sampling
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 33 (1-6), 295-308, 2000
3432000
Boxwood: Abstractions as the Foundation for Storage Infrastructure.
J MacCormick, N Murphy, M Najork, CA Thekkath, L Zhou
OSDI 4, 8-8, 2004
2782004
Position Bias Estimation for Unbiased Learning to Rank in Personal Search
X Wang, N Golbandi, M Bendersky, D Metzler, M Najork
11th ACM International Conference on Web Search and Data Mining, 2018
2762018
Learning to rank with selection bias in personal search
X Wang, M Bendersky, D Metzler, M Najork
39th International ACM SIGIR Conference on Research and Development in ¡¦, 2016
2712016
Automatically Creating Training Data For Language Identifiers
M Goldszmit, M Najork, S Paparizos
US Patent App. 13/943,788, 2015
2302015
On the evolution of clusters of near-duplicate web pages
D Fetterly, M Manasse, M Najork
Proceeding of the 1st Latin American Web Congress, 37-45, 2003
2232003
High-performance web crawling
M Najork, A Heydon
Handbook of massive data sets, 25-45, 2002
2162002
WIT: Wikipedia-based image text dataset for multimodal multilingual machine learning
K Srinivasan, K Raman, J Chen, M Bendersky, M Najork
44th International ACM SIGIR Conference on Research and Development in ¡¦, 2021
2122021
SOCIAL NETWORK RECOMMENDED CONTENT AND RECOMMENDING MEMBERS FOR PERSONALIZED SEARCH RESULTS
T Harrington, R Shenoy, M Najork, R Panigrahy
US Patent App. 13/252,215, 2013
2102013
Measuring index quality using random walks on the Web
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 31 (11-16), 1291-1303, 1999
2081999
Detecting phrase-level duplication on the world wide web
D Fetterly, M Manasse, M Najork
Proceedings of the 28th annual international ACM SIGIR conference on ¡¦, 2005
1902005
System and method for associating an extensible set of data with documents downloaded by a web crawler
MA Najork, CA Heydon
US Patent 6,351,755, 2002
1832002
A sketch-based distance oracle for web-scale graphs
A Das Sarma, S Gollapudi, M Najork, R Panigrahy
Proceedings of the third ACM international conference on Web search and data ¡¦, 2010
1702010
Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining ¡¦
MA Najork, CA Heydon, JL Wiener
US Patent 6,263,364, 2001
1642001
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–20