Optimal Document Archiving and Fast Information Retrieval
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33122
Optimal Document Archiving and Fast Information Retrieval

Authors: Hazem M. El-Bakry, Ahmed A. Mohammed

Abstract:

In this paper, an intelligent algorithm for optimal document archiving is presented. It is kown that electronic archives are very important for information system management. Minimizing the size of the stored data in electronic archive is a main issue to reduce the physical storage area. Here, the effect of different types of Arabic fonts on electronic archives size is discussed. Simulation results show that PDF is the best file format for storage of the Arabic documents in electronic archive. Furthermore, fast information detection in a given PDF file is introduced. Such approach uses fast neural networks (FNNs) implemented in the frequency domain. The operation of these networks relies on performing cross correlation in the frequency domain rather than spatial one. It is proved mathematically and practically that the number of computation steps required for the presented FNNs is less than that needed by conventional neural networks (CNNs). Simulation results using MATLAB confirm the theoretical computations.

Keywords: Information Storage and Retrieval, Electronic Archiving, Fast Information Detection, Cross Correlation, Frequency Domain.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1083121

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1590

References:


[1] Blake, Monica," Archiving of Electronic Publications," Electronic Library, v7 n6 p376-86 Dec 1989
[2] Blake, Monica," Aspects of Electronic Archives," Electronic Publishing Review, v6 n3 p151-67 Sep 1986
[3] GAIL M., HODGE, "Best Practices for Digital Archiving," January 2000 issue of D-Lib Magazine, Volume 6 Number 1
[4] Ibrahim S. I. "Arabic Font Recognition using Decision Trees Built from Common Words," Journal of Computing and Information Technology - CIT 13, 2005, 3, 211-22.
[5] James A. Storer, Thomas G., "Data compression via textual substitution," Journal of the ACM (JACM) Volume 29, Issue 4, October 1982, Pages: 928 - 951.
[6] Jan A., Bernard J., "Font compression and retrieval," US Patent Issued on May 1, 2007.
[7] Khorsheed M.S., Clocksin, W.F., Spectral features for Arabic word recognition," Acoustics, Speech, and Signal Processing, ICASSP, Proceedings IEEE International Conference, 2000.
[8] Mohammad S., William F.," Multi-Font Arabic Word Recognition Using Spectral Features", 15th International Conference on Pattern Recognition (ICPR'00) - Volume 4 p. 4543
[9] Namane A., Sid-Ahmed M.A. ," Character scaling by contour method", Pattern Analysis and Machine Intelligence, IEEE Transactions on, Jun 1990, Volume:12, page(s):600-606.
[10] Sayood K., "Introduction to Data Compression," Morgan Kaufmann, 2006
[11] Syed A. A.," System, method and computer program product for generic outline font compression," United States Patent 6,614,940, September 2, 2003.
[12] Thomas A. Phelps, Robert W.," Two diet plans for fat PDF," Proceedings of the ACM symposium on Document engineering Grenoble, France, 2003, Pages: 175 - 184.
[13] "Fonts", http://www.w3.org/TR/REC-CSS2/fonts.html
[14] "Graphic File Formats at a Glance", www.visl.technion.ac.il/labs/anat/2/fileformats.pdf
[15] "PDF as a Standard for Archiving", http://www.adobe.com/enterprise/pdfs/pdfarchiving.pdf
[16] "U.K. Records Management for Central Government", http://www.pro.gov.uk/recordsmanagement
[17] "Victorian Electronic Records Strategy Standards and Guides", http://www.prov.vic.gov.au/vers/standards/standards.htm
[18] "The Long-Term Preservation of Authentic Electronic Records: Findings of the InterPARES Project", http://www.interpares.org/book/index.cfm
[19] A. A. Mohammed ," The Effect of Arabic Fonts on Electronic Archive Size", Mansoura Journal for computer science and information systems, vol. 4, No. 4, 2007
[20] H. M. El-Bakry, "A Novel High Speed Delay Neural Model for Fast Pattern Recognition," Accepted for publication in Soft Computing Journal.
[21] H. M. El-Bakry, "Fast Virus Detection by using High Speed Time Delay Neural Networks," Accepted for publication in journal of computer virology.
[22] H. M. El-Bakry, "New Fast Principal Component Analysis For Real- Time Face Detection," Accepted for publication in MG&V Journal.
[23 J. W. Cooley, and J. W. Tukey, An algorithm for the machine calculation of complex Fourier series, Math. Comput. 19, 297-301 1965.
[24] Klette R., and Zamperon, "Handbook of image processing operators, " John Wiley & Sonsltd, 1996.
[25] H. M. El-bakry, "An Efficient Algorithm for Pattern Detection using Combined Classifiers and Data Fusion," Accepted for publication in Information Fusion Journal.
[26] Hazem M. El-bakry, and Mohamed Hamada "Fast Time Delay Neural Networks for Detecting DNA Coding Regions," Proc. of Kes 2009, Part I, LNAI AI 5711, Sٍpringer, September 28-30, 2009, pp. 334-342.
[27] H. M. El-Bakry and M. Hamada, " New Fast Decision Tree Classifier for Identifying Protein Coding Regions," Lecture Notes in Computer Science, Springer, ISICA 2008, LNCS 5370, 2008, pp. 489-500.
[28] H. M. El-Bakry and M. Hamada, "A New Implementation for High Speed Neural Networks in Frequency Space," Lecture Notes in Artificial Intelligence, Springer, KES 2008, Part I, LNAI 5177, pp. 33-40.
[29] H. M. El-Bakry, "New Faster Normalized Neural Networks for Sub- Matrix Detection using Cross Correlation in the Frequency Domain and Matrix Decomposition," Applied Soft Computing journal, vol. 8, issue 2, March 2008, pp. 1131-1149.
[30] H. M. El-Bakry, and Nikos Mastorakis "New Fast Normalized Neural Networks for Pattern Detection," Image and Vision Computing Journal, vol. 25, issue 11, 2007, pp. 1767-1784.
[31] H. M. El-Bakry and Nikos Mastorakis, "Fast Code Detection Using High Speed Time Delay Neural Networks," Lecture Notes in Computer Science, Springer, vol. 4493, Part III, May 2007, pp. 764-773.
[32] H. M. El-Bakry, "New Fast Principal Component Analysis for Face Detection," Journal of Advanced Computational Intelligence and Intelligent Informatics, vol.11, no.2, 2007, pp. 195-201.
[33] H. M. El-Bakry, "New Fast Time Delay Neural Networks Using Cross Correlation Performed in the Frequency Domain," Neurocomputing Journal, vol. 69, October 2006, pp. 2360-2363.
[34] H. M. El-Bakry, and Nikos Mastorakis, "A Novel Model of Neural Networks for Fast Data Detection," WSEAS Transactions on Computers, Issue 8, vol. 5, November 2006, pp. 1773-1780.
[35] H. M. El-Bakry, and N. Mastorakis, "A New Approach for Fast Face Detection," WSEAS Transactions on Information Science and Applications, issue 9, vol. 3, September 2006, pp. 1725-1730.
[36] H. M. El-Bakry, "A New Implementation of PCA for Fast Face Detection," International Journal of Intelligent Technology, Vol. 1, No.2, 2006, pp. 145-153.
[37] H. M. El-Bakry, and Q. Zhao, "Fast Normalized Neural Processors For Pattern Detection Based on Cross Correlation Implemented in the Frequency Domain," Journal of Research and Practice in Information Technology, Vol. 38, No.2, May 2006, pp. 151-170.
[38] H. M. El-Bakry, "Fast Painting with Different Colors Using Cross Correlation in the Frequency Domain," International Journal of Computer Science, vol.1, no.2, 2006, pp. 145-156.
[39] H. M. El-Bakry, "Faster PCA for Face Detection Using Cross Correlation in the Frequency Domain," International Journal of Computer Science and Network Security, vol.6, no. 2A, February 2006, pp.69-74.
[40] H. M. El-Bakry, "New High Speed Normalized Neural Networks for Fast Pattern Discovery on Web Pages," International Journal of Computer Science and Network Security, vol.6, No. 2A, February 2006, pp.142- 152.
[41] H. M. El-Bakry, and Q. Zhao, "Fast Time Delay Neural Networks," International Journal of Neural Systems, vol. 15, no.6, December 2005, pp.445-455.
[42] H. M. El-Bakry, and Q. Zhao, "Speeding-up Normalized Neural Networks For Face/Object Detection," Machine Graphics & Vision Journal (MG&V), vol. 14, No.1, 2005, pp. 29-59.
[43] H. M. El-Bakry, "Pattern Detection Using Fast Normalized Neural Networks," Lecture Notes in Computer Science, Springer, vol. 3696, September 2005, pp. 447-454.
[44] H. M. El-Bakry, "Human Face Detection Using New High Speed Modular Neural Networks," Lecture Notes in Computer Science, Springer, vol. 3696, September 2005, pp. 543-550.
[45] H. M. El-Bakry, and Q. Zhao, "Fast Pattern Detection Using Normalized Neural Networks and Cross Correlation in the Frequency Domain," EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Intelligent Vision Systems: Methods and ApplicationsÔÇöPart I, vol. 2005, no. 13, 1 August 2005, pp. 2054-2060.
[46] H. M. El-Bakry, "A New High Speed Neural Model For Character Recognition Using Cross Correlation and Matrix Decomposition," International Journal of Signal Processing, vol.2, no.3, 2005, pp. 183-202.
[47] H. M. El-Bakry, and Q. Zhao, "A Fast Neural Algorithm for Serial Code Detection in a Stream of Sequential Data," International Journal of Information Technology, vol.2, no.1, pp. 71-90, 2005.
[48] H. M. El-Bakry, and Q. Zhao, "Fast Complex Valued Time Delay Neural Networks," International Journal of Computational Intelligence, vol.2, no.1, pp. 16-26, 2005.
[49] H. M. El-Bakry, and Q. Zhao, "Fast Pattern Detection Using Neural Networks Realized in Frequency Domain," Enformatika Transactions on Engineering, Computing, and Technology, February 25-27, 2005, pp. 89- 92.
[50] H. M. El-Bakry, and Q. Zhao, "Sub-Image Detection Using Fast Neural Processors and Image Decomposition," Enformatika Transactions on Engineering, Computing, and Technology, February 25-27, 2005, pp. 85- 88.
[51] H. M. El-Bakry, and Q. Zhao, "Face Detection Using Fast Neural Processors and Image Decomposition," International Journal of Computational Intelligence, vol.1, no.4, 2004, pp. 313-316.
[52] H. M. El-Bakry, and Q. Zhao, "A Modified Cross Correlation in the Frequency Domain for Fast Pattern Detection Using Neural Networks," International Journal on Signal Processing, vol.1, no.3, 2004, pp. 188- 194.
[53] H. M. El-Bakry, and Q. Zhao, "Fast Object/Face Detection Using Neural Networks and Fast Fourier Transform," International Journal on Signal Processing, vol.1, no.3, 2004, pp. 182-187.
[54] H. M. El-Bakry, "Face detection using fast neural networks and image decomposition," Neurocomputing Journal, vol. 48, October 2002, pp. 1039-1046.
[55] H. M. El-Bakry, "Human Iris Detection Using Fast Cooperative Modular Neural Nets and Image Decomposition," Machine Graphics & Vision Journal (MG&V), vol. 11, no. 4, 2002, pp. 498-512.
[56] H. M. El-Bakry, "Fast Face Detection Using Neural Networks and Image Decomposition," Lecture Notes in Computer Science, Springer, vol. 2252, December, 2001, pp.205-215.
[57] H. M. El-Bakry "Fast Iris Detection for Personal Verification Using Modular Neural Networks," Lecture Notes in Computer Science, Springer, vol. 2206, October 2001, pp. 269-283.
[58] H. M. El-Bakry, "Automatic Human Face Recognition Using Modular Neural Networks," Machine Graphics & Vision Journal (MG&V), vol. 10, no. 1, 2001, pp. 47-73.