A Survey On Deduplication File System For Virtual Machine Images
Shima V M, Dr. Jayasudha J S
Keywords: Virtual Machine, Deduplication, LiveDFS, Liquid.
ABSTRACT:Cloud computing is a dominant technology in IT industry for providing different services to users of cloud. One important service is infrastructure as a service. The virtualization is the technique used in infrastructure as a service in which it allocates the virtual machine to user through Internet. Virtual machine which is a guest machine that runs in the host machine environment. Virtual Machine Infrastructure is used to buy Virtual Machine instances to run virtual machines in the cloud platforms. The high overhead of maintaining a virtual machine has been achieved by hardware support such as Intel virtualization technology (VT) by improving the implementation of hypervisor such as KVM, etc. Storage of virtual machine remains a challenging problem because of the high demand of Virtual machine images. A Storage Area Network cannot satisfy the increasing demand of large scale virtual machine hosting for cloud computing because of its cost limitation. The provisioning and depository of a number of Virtual Machine Infrastructure is a challenging problem. The data deduplication is a method used in Virtual Machine Infrastructure and there are various data deduplication techniques which make Virtual Machine Infrastructure storage and provisioning simple and efficient. In this paper, different methods and processes are discussed which is used in deduplication of data to overcome the problems faced in virtual machine images.
 J.S. Bridle, “Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition,” Neurocomputing—Algorithms, Architectures and Applications, F. Fogelman-Soulie and J. Herault, eds., NATO ASI Series F68, Berlin: Springer-Verlag, pp. 227-236, 1989. (Book style with paper title and editor)
 W.-K. Chen, Linear Networks and Systems. Belmont, Calif.: Wadsworth, pp. 123-135, 1993. (Book style)
 H. Poor, “A Hypertext History of Multiuser Dimen-sions,” MUD History, http://www.ccs.neu.edu/home/pb/mud-history.html. 1986. (URL link *include year)
 K. Elissa, “An Overview of Decision Theory," unpublished. (Unplublished manuscript)
 R. Nicole, "The Last Word on Decision Theory," J. Computer Vision, submitted for publication. (Pending publication)
 C. J. Kaufman, Rocky Mountain Research Laboratories, Boulder, Colo., personal communication, 1992. (Personal communication)
 D.S. Coming and O.G. Staadt, "Velocity-Aligned Discrete Oriented Polytopes for Dynamic Collision Detection," IEEE Trans. Visualization and Computer Graphics, vol. 14, no. 1, pp. 1-12, Jan/Feb 2008, doi:10.1109/TVCG.2007.70405. (IEEE Transactions )
 S.P. Bingulac, “On the Compatibility of Adaptive Controllers,” Proc. Fourth Ann. Allerton Conf. Circuits and Systems Theory, pp. 8-16, 1994. (Conference proceedings)
 H. Goto, Y. Hasegawa, and M. Tanaka, “Efficient Scheduling Focusing on the Duality of MPL Repre-sentation,” Proc. IEEE Symp. Computational Intelli-gence in Scheduling (SCIS ’07), pp. 57-64, Apr. 2007, doi:10.1109/SCIS.2007.367670. (Conference proceedings)
 J. Williams, “Narrow-Band Analyzer,” PhD disserta-tion, Dept. of Electrical Eng., Harvard Univ., Cam-bridge, Mass., 1993. (Thesis or dissertation)
 E.E. Reber, R.L. Michell, and C.J. Carter, “Oxygen Absorption in the Earth’s Atmosphere,” Technical Report TR-0200 (420-46)-3, Aerospace Corp., Los Angeles, Calif., Nov. 1988. (Technical report with report number)
 L. Hubert and P. Arabie, “Comparing Partitions,” J. Classification, vol. 2, no. 4, pp. 193-218, Apr. 1985. (Journal or magazine citation)
 R.J. Vidmar, “On the Use of Atmospheric Plasmas as Electromagnetic Reflectors,” IEEE Trans. Plasma Science, vol. 21, no. 3, pp. 876-880, available at http://www.halcyon.com/pub/journals/21ps03-vidmar, Aug. 1992. (URL for Transaction, journal, or magzine)
 J.M.P. Martinez, R.B. Llavori, M.J.A. Cabo, and T.B. Pedersen, "Integrating Data Warehouses with Web Data: A Survey," IEEE Trans. Knowledge and Data Eng., preprint, 21 Dec. 2007, doi:10.1109/TKDE.2007.190746.(PrePrint)
 Offline Deduplication for Btrfs. http://www.spinics.net/lists/ linux-btrfs/msg07818.html.
 A.T. Clements, I. Ahmad, M. Vilayannur, and J. Li, ‘‘Decentralized Deduplication in San Cluster File Systems,’’ in Proc. Conf. USENIX Annu. Techn. Conf., 2009, p. 8, USENIX Association.