Digital Web Archive Collection and Storage Model Based on Soft Computing and Edge-Driven System


 With the continuous development of archives information network, it has become an important means to obtain archives information resources based on network means. At the same time, the number of users of network archives is increasing rapidly. It has become an urgent task to carry out the research on network archives users and do a good job in the management of network archives users. As the main body of archives utilization, archives users play a leading role in the content and direction of archives management and utilization. The informatization of archives management is not equivalent to computerization or networking, but a systematic project related to the whole educational reform and modernization. As the modern archives management has become a complex system, there are also multi-level division methods. The first level is divided into two subsystems: Archives entity management and archives information development, and each subsystem is further divided into several levels of small systems. Archives entity management is divided into collection, sorting, identification, storage, statistics and other work links. The whole file management system and its subsystems form feedback mechanism in operation. With the rapid development of computer technology and network technology, multimedia information system is penetrating into people's lives. Multimedia information system mainly includes three parts: central management system, terminal display system and network. Based on the multimedia information management system, this paper constructs a digital network archives collection and storage model.


Introduction
With the strengthening of social archives security protection, people pay more and more attention to archives. As the main body of archives utilization, archives users play a leading role in the content and direction of archives management and utilization. The research on archives users is helpful for archives institutions to better understand users' needs and provide archives information services with high efficiency and high quality. With the rapid development of the Internet and the Internet, the network and the network of the public are becoming more and more important. At the same time, government departments, companies, scientific research institutions, etc. have built various forms of network platforms, and with the help of proprietary or public network platforms, more and more file information is provided, and archives resources can be consulted [1][2][3][4]. With the emergence of network technology, new changes have taken place in the information demand of archives users, the utilization and management methods of archives resources. The research of Archives Users under the network environment has attracted the attention of the archival circles. Some archival scholars and practical workers have begun to explore the influence of the network environment on the process and subject of archives utilization, and the research on archives users has become a hot spot again [5][6][7][8].
For a long time, the passivity of archives organizations has seriously hindered the users' requirements for high efficiency and high quality of information services in the network environment. The traditional way of providing information services is too simple to take into account the needs of the overwhelming majority of the people. Therefore, the Internet provides us with an open and real-time information retrieval platform. More and more archives information users can obtain the required file information without the limitation of time and space. Therefore, under the background of national archives informatization, it is necessary for us to conduct a comprehensive research on the archives information service under the network environment, and understand the research status and development trend of archives information service [9][10][11][12]. The ultimate purpose of file management is to provide file information for social practice. The structure of file management system is set up according to this purpose. Each of them is essential and has a certain procedure. They form an organic whole and play their respective roles in order to realize the overall function of the file management system, and at the same time, they are interrelated and mutually restricted [13][14][15][16].
The significance of network document archives management system is as follows: (1) Strengthen the whole process management requirements of archives. Archives management business covers the traditional archives plan management business, management conditions and innovation environment construction, as well as approval, audit business. In order to better management, we need to manage all these businesses through a set of systems, so we need to build an integrated management system.
(2) Strengthen the cooperation ability of archives management personnel. The archives plan involves different levels of archives management departments such as the document and archives management department, many different positions in different business departments of the document and archives department, the decision-making level, the management level and the executive level, and a large number of experts, intermediary institutions and archives units. Management system can effectively improve the efficiency of their collaborative work.
(3) Improve the interaction between archives office and enterprises and institutions. Enterprises are the main archivists of science and technology plan, experts play an important role in the process of science and technology plan management, and recommendation units and intermediary organizations are important participants in science and technology management. However, there is no effective means to interact with them seamlessly. Through the construction of this management system, online communication and directional notice can be sent to specific groups. These means can effectively strengthen the interaction between government and enterprises.
(4) Realize the sharing of archives resources. Due to the division of work, different archives resources are managed by different people. There are a lot of important files and information management systems in it. Through an effective authorization mechanism, people with authority can view all relevant information.
(5) Establish the basis of daily management of archives. Through the construction of management system, more and more expert databases with higher quality can be established. It can establish a standard project database which is convenient for decision analysis in the future. The whole process of planning, archives, acceptance, grouping, review, approval, publicity, project establishment, tracking and supervision, and final acceptance are completed through the management system. The system will automatically remind the relevant personnel of the work to be done at the critical moment, which can be convenient for directional release of announcements. The system can flexibly select the relevant list and send e-mail, fax and SMS in groups.
(6) Strengthen project supervision. Due to the long project cycle, the large number of projects to be supervised, and the different emphasis to be supervised in different stages, the project supervision work is difficult to achieve. The management system can set the supervision time and automatically remind the relevant personnel for inspection. Automatically count and screen the list of enterprises that do not meet the regulatory requirements. This provides a powerful weapon for the project supervision of archives management personnel.
Multimedia technology is a new computer technology developed in 1980s. The development of computer multimedia technology is a process of gradual accumulation. In the 1950s, the processing and application of computer is numerical calculation, which takes digital media as its processing object. In the 1960s, it was extended to data processing. It was a combination of computer and character processing technology and text processing technology. It took digital and character media as processing objects. Then, the combination of computer, graphics and mathematics led to the computer-aided design technology mainly dealing with graphic media. With the combination of computer and image processing technology, computer image processing technology based on processing image media is produced. With the combination of computer and speech processing technology, the computer speech processing technology mainly dealing with voice media is produced. The multimedia technology developed in 1980s is not only a comprehensive integration of the above technologies, but also a close combination of computer and image processing technology. As a result, a multi-media technology is formed, which takes the computer as the tool and sets the media such as picture, text, sound, dynamic and static images as the comprehensive processing object. The emergence of multimedia technology has formed a new technology content of computer, providing a new application environment for users [17][18][19][20].
Nowadays, multimedia information publishing system provides accurate and realtime public information for the public. Government departments, schools, enterprises and even military forces are widely using multimedia information release system [21][22][23][24]. It can be seen that the release of information through multimedia has greatly enriched the channels and presentation of information, increased the comprehensiveness and timeliness of information, and achieved good social benefits.

Literature Review
The research on archives users in China can be traced back to the theory of archives utilization in 1960s. The development of archives utilization theory involves the problem of archives users. At the end of 1980s, archival user research has gradually developed in our country, but the attention to user needs research is gradually developing, and there is no theoretical system like that of the shape image information user research. At the end of the 20th century, with the integration of computer technology into the field of archives service, the trend of digitalization of archives information resources is obvious, which has an impact on the research of archives users, and the research of network archives users has begun to attract attention. Guo Xiaoyun thinks that the paid archives information service implemented by archives institutions under the network environment is the extension and development of the traditional paid archives service under the market economy, which is normal and appropriate. At the same time, she also proposed that the fees charged for the online archives information services include the fees for the traditional archives paid services, as well as the new fees for providing services under the network environment, such as the digital fees generated through the network. Jin Fan believes that the formation of archives, the nature of archival work and the realization law of archival value determine and affect the research and development of Archival Information Service under the network environment to a certain extent. Under the network environment, the archives information service is becoming more and more socialized, and the most typical performance is the industrialization of the whole information service. The industrialization of network archives information service will also be an important part of information service industrialization. Archives information service is an important part of national public service, which should play an active and important role in scientific research and social production practice.
In recent years, driven by the strong demand for office informatization, Chinese government departments and large and medium-sized enterprises have rapidly promoted their own information construction. VCD was first designed in China and has been widely used. The Institute of acoustics of Chinese Academy of Sciences has made outstanding contributions to the design and industrialization of MPEG-1 decoding equipment. The MPEG-2 decoding equipment jointly completed by Harbin Institute of technology and the national intelligent computer research and development center of the Institute of computing, Chinese Academy of Sciences, has been successfully applied to DVD and digital satellite TV receivers, and has entered the industrialization stage [25][26][27][28]. In the mid-1980s, the concept of multimedia was introduced into China, and some methods of multimedia technology were gradually realized. In the 1990s, multimedia technology began to attract attention and development. The second stage is the standardization stage. In this century, multimedia information publishing technology has been developed rapidly because of its good visual interface. At this time, large domestic enterprises and institutions and many government departments have built multimedia information publishing system based on client server system. The advantage of client server system is that the information resources are saved on the server, and the flexibility of multimedia information is realized. The application system based on the client server system also has some shortcomings, such as strong dependence on the operating system and poor scalability, which can not meet the growing needs of users. At this time, the third stage of the vigorous development of multimedia publishing technology came into being, and the multimedia information publishing system based on network platform came into being. The system can manage and broadcast multimedia information in real time.

Demand Analysis of Multimedia Information System
According to the software requirements specification of the system, the functions of the multimedia information publishing system are as follows: users can log in and operate on the web page. The multimedia system manages and sets up various types of multimedia materials, and sends the information to the multimedia terminal for publishing or playing. Multimedia materials include electronic documents, pictures, recording, video, flash, etc. In addition, the system has the function of managing these multimedia terminals.
The system consists of three parts Server: it provides collaborative management, data storage and distribution functions of all terminals in the whole network, and provides address mapping between management terminal and playback terminal.
Management terminal: provides basic management and program production of data such as material, program, broadcast table, broadcast terminal and user, as well as remote control of broadcast terminal.
Broadcast terminal: it provides the automatic play of programs and play lists and the emergency broadcast of inserted contents, and accepts the remote control of the management terminal. At the same time, it sends heartbeat data to the server regularly to maintain online status.
The components of multimedia information system are shown in Figure 1. (1) Authority management. The user login interface is provided on the home page of the system, after which the user can use the system. The user permissions of the system are divided into ordinary users and administrator users. This system is not open to the public registration, only the administrator user has the authority to register new users.
Administrators can also delete the ordinary users, the system to determine the rights of resources.
(2) Resource management. Users can upload local multimedia resources to the server through the resource management function, and can query the resources stored on the server. You can download the multimedia resources in the server to the local, or you can delete the resources in the server.
(3) Programming. Users can edit the multimedia resources on the server to produce the broadcast program through the program arrangement function.
(4) Broadcast table editing. Users can edit the prepared programs through the playlist editing function, and combine one or more programs into a playlist by adding playing properties to the programs.
(5) Terminal management. Terminal refers to the playing terminal. Users can control and manage the playing terminal through terminal management.
For the current system architecture in the software development field, there are mainly two software architectures to choose from: the first one is client/server mode (C/S); The second is the Web-based distributed object computing model, known as the B/S model.
Distributed on client and server. In C/S architecture, the server mainly provides data management, data sharing, data and system maintenance, concurrency control and other functions, while the client program mainly completes the specific business of the user. The user issues instructions to the server through the client, and the server extracts corresponding data according to the instructions received and sends them to the client. Then the client performs maintenance operations such as adding, modifying and deleting the received data, or sends the running results back to the server. When there is a large amount of data interaction between the two parties, it is likely to cause network congestion. C/S mode is usually used to reduce network traffic by processing data in the background database and then sending the processing results to the client. This avoids sending large amounts of data to the client for processing [29][30][31][32].
C/S architecture mainly has the following characteristics: Strong interactivity. In C/S architecture, the client is a complete set of applications, with very powerful functions in data processing, application interaction and other aspects. In C/S architecture, the development of user interface is relatively simple and fast. There is no need to define interface control style or adopt layout technology to control the interface, so the clear layout of system pages can be easily realized.
Provides a more secure access mode. Because C/S architecture is a point-to-point structure pattern in which the server and the client are paired together, it adopts a network protocol with good security, so it has high security.
It has fast execution speed. Because the C/S architecture's business logic is processed on the client side, the results are then submitted to the server. Client response speed is fast, can effectively improve the execution speed of the system.
In B/S architecture, the user's working interface is the web page of the browser, a small amount of data logic is processed in the front end, and the main data logic is processed in the server side. B/S architecture is a network architecture pattern after the rise of the Internet, and the most important application software of the client is Web browser. This pattern makes the client more unified, and it centralizes the core functions of the system to the server. This makes it easier to develop, maintain, and use the system. B/S architecture mainly has the following characteristics: Compatible with various operating systems and software platforms. The client can be realized through a common browser, which can save a lot of hard disk space and memory of the client, and the installation process is very simple.
System maintenance, expansion and upgrading is simple and convenient. Since most B/S clients are browsers, there is little need to do any maintenance on them. Maintenance of the administrative server is generally required. When the system is upgraded, only the server side is upgraded. It is relatively easy to upgrade and maintain the system, which can save a lot of manpower, material resources, time and cost.
The function design of multimedia information release system adopts the scheme of module subdivision. In the overall function, the system is designed as database module, management terminal module, playing terminal module. Management terminal module includes login management module, user authority module, resource management module, programming module, playlist editing module, client management module, etc. The playing terminal module includes the playing interface module, interactive module, resource download module and so on.

ASP.NET based System Development
ASP.NET provides a component-based approach to design, develop, assemble, and deploy enterprise applications. Its goal is to provide a platform-independent, multi-user enterprise-level security development platform for B/ s-based servers. Figure 2 shows the three-tier architecture of ASP.NET technology. Figure 2. ASP.NET technical system For software system, database technology is the advanced stage of data management. Database technology is to use the idea of data storage to organize and standardize the data model, and store the canonical data structure in memory to form a regular data set. A database provides application-level application services to a specific organizational structure in an optimized manner. The purpose of a data warehouse is to provide support for decision making through the analysis of historical information and subject-oriented information in the database. Traditional business systems make it difficult to get information about the state of the business. Data warehouse is a part of enterprise decision support.
Before the design and development of software engineering project, it is necessary to establish the model. The UML modeling tool used in this article is the Rational Rose environment, and its working interface is shown in Figure 3. The Rational Rose tool simplifies, extracts, and refines the architectural and functional modeling of software systems to reduce unnecessary design costs. Using the development tool to build UML model can effectively meet the designer's design requirements and extension requirements in the software development cycle Asynchronous JavaScript and XML (AJAX) is a Web development technique for creating interactive web applications. In AJAX, XMLHTTPRequest is a core technology that provides a way for JAVASCRIPT scripts on a page to communicate with the server through these scripts. The JAVASCRIPT within the page can interact with the server without refreshing the page, that is, the page can either fetch data from the server or submit data to the server. Compared to traditional WEB technologies, AJAX can maintain data without updating the entire page. This allows Web applications to respond to user actions more quickly and avoids sending messages over the network that have not changed.
ASP.NETAJAX includes the server-side part and the client-side part, among which the server-side part of ASP.NETAJAX provides five server controls: ScriptManager control: The script that manages the page.

Construction of Multimedia Information Technology Management System
The information of multimedia information resources usually comes from two aspects: one is the non-borrowing part, which is usually added by oneself; The other is usually information collected over the Internet or provided by others. AJAX is used extensively in the process of multimedia data uploading. When selecting a file, the system asynchronously retrieves the properties of the file and other related properties. This can be statistics background accurate statistics of some search information, and do not require user participation. This approach can improve the accuracy of information, and improve the efficiency of adding information and user's sense of experience.

Design of Network Archive Database
The database should have a logical structure that describes the data name, type, storage location, length, and other related parameters. The design of database should support the basic application and function of multimedia information release system. The rationality and security of database design play a vital role in the system. The database must meet the following requirements: Real-time: Fast access to data to meet the requirements of real-time functions. Flexibility: Multiple ways of accessing databases should be provided, requiring at least one to be intuitive and easy to access.
Recoverability: the database system is required to quickly recover to the state before the failure after the system failure disappears.
Integrity: In all cases, the integrity of the data in the database should be guaranteed.
Multimedia information system is a relational database application system. All information processed by the system is stored in the database. In the development of database application system, the structure design of database is a very important problem. The multimedia information system database is implemented by SQL Server database, which is mainly composed of user data, program data, broadcast table data and resource data. In order to avoid reading data that does not exist, the data in all tables is not allowed to be null.
In an efficient multimedia storage server, the issues associated with a storage manager include: (1) A variety of magnetic and optical storage devices are used to meet the needs of different multimedia applications. These storage devices can have different properties, and they will be modeled uniformly and integrated into a single multimedia storage manager.
(2) Multimedia data requires a large storage space. In some application environments, the third and multiple storage layers must be used. In terms of multimedia application requirements, it is very important to make unified and effective integration for multi-level storage.
(3) The real-time retrieval performance of some multimedia objects requires quite a lot of running time. It is necessary to develop appropriate scheduling and resource allocation algorithms to achieve good performance in environments where delay sensitive data and non-delay sensitive data are mixed.
(4) Multimedia data is generally characterized by large volume and real-time retrieval. Therefore, it is necessary to effectively use the parallel mechanism of storage and processing in such systems.
(5) Many important multimedia information system applications are actually distributed. In order to ensure the system has a good performance of multimedia data retrieval, it is necessary to have the support of distributed system.
In a database environment, a combination of collection-oriented lookup and cruiseoriented browsing techniques is commonly used. Hypertext -and hypermedia-oriented systems use linked lists to cruise the database to find relevant concepts. Techniques such as keyword lists and relevance feedback are also used to create user profiles for effective query definitions and refinement. Typically, information is received through a variety of information sources and stored in an internal representation that facilitates addressable access, efficient display, and information extraction of content. The purpose of information extraction and display is to extract the associated information from the repository and present it to the user in the most appropriate form. This processing often involves the transformation of information.
Distributed hierarchical storage management functions include migration and retrieval tasks: 1. Migration is the process of moving data from a client to a migration library, which is a remote mass storage device.
2. Retrieval is the movement of data from a remote storage device back to the original location of the client.
Distributed hierarchical storage management moves data between migrating clients and migrating storage servers. This migration is managed by a distributed hierarchical storage manager. A migration client is any computer on the network that contains data to be migrated. A migration server is a computer that provides migration services on the network. Migrating storage areas on the migration server can be disk, tape, or CD-ROM storage media.

Design of Multimedia Information System Storage Management
Multimedia data requires a lot of storage capacity: there are already multiple disks for archiving multimedia data. They are inexpensive, have large storage capacity, and have fast random access speed. Several other performance optimizations must be made to achieve the performance levels required for future VLBS. These optimizations include leveraging third-level storage, parallel mechanisms, hypercache, migration algorithms, scheduling, and query optimization, among others.
Multimedia storage managers should support scalability and storage transparency. In cases where multimedia data can take up a large amount of storage space, the system should be able to provide a variety of operations using multimedia data. . In a multimedia information retrieval environment, a large number of objects meet a query requirement, but users do not all use them, because all they need is a subset of the data. At this point, it is more economical not to transfer these large objects to main memory until the query request is very precise.
Because of the data transfer time, the retrieval of large objects may be delayed. The system should try to reduce this delay by using the parallel mechanism of secondary storage. Different multimedia data types have different feature requirements, so it is necessary to use appropriate data layout algorithms for different data types.
The storage manager should be able to efficiently and transparently handle multiple storage layers. Main storage is the first storage layer, which is fast and growing in capacity and can be used to hold certain active parts of the database. A typical secondary storage layer consists of disks and optical disks, also known as secondary storage, which are used to store multimedia data with relatively high access probabilities. The third level of storage is used to store data that is not frequently accessed. Each disc has a number of read-write devices associated with it, and in larger configurations there may be more than one disc. The third storage layer has great capacity and can meet the space requirement of multimedia database application, but it is very slow.
With the development of multimedia technology, the speed of auxiliary storage device and the transmission speed of communication channel have been greatly improved. Fiber optic transmission rates can reach hundreds of megabits per second. However, the past practice shows that users will soon consume all kinds of additional resources they can get. Therefore, it is necessary to research and develop algorithms to ensure good realtime transmission and synchronization performance of multimedia information. These algorithms should use the super cache memory on the network, and use multimedia data model to ensure data transmission and synchronization.
Under the existing conditions, the storage management of multimedia information system can be designed based on a loosely coupled architecture. In other words, database management system is used to manage index information and file logical address, while file manager is used to manage multimedia files. An integration module is responsible for integrating DBMS and multimedia file manager. In this way, we can use the existing multimedia file management technology and DBMS technology to manage the storage of multimedia information system.
In the retrieval process, the query interface provided by the storage management system first receives the user's retrieval request and the related information of the query target. Then, the retrieval module of storage management system queries the index information base according to the index information provided by the user, and displays all index records in the index information base that meet the query target information provided by the user. At this time, the multimedia object itself is not transmitted to the local. When the user selects the multimedia data object according to the index record information displayed by the system, the retrieval module of the system communicates with the index information base again to obtain the logical address of the multimedia object. The system automatically finds the retrieved multimedia object from the multimedia storage server according to the logical address, and transmits the object back to the client for browsing.

Experiment and Verifications
This paper constructs a network file acquisition and storage model based on multimedia resource management. In the multimedia resource management module, it mainly displays all the resource information and provides retrieval approaches. The file list page and the query page are shown in Figure 4:  Table 1  It can be seen that the response time increases with the number of users or terminals. But the time is within the acceptable range of users, which ensures the real-time of multimedia information release.
After the user has configured the basic user configuration and data configuration information in the login interface, click the "confirm download" button to download the data. If a user authentication error occurs, the server returns an error message. If the user is authenticated, the server will return the user's authorized data information to the client. The interface for server-side users to download files is shown in Figure 5.

Conclusion and Prospect
Archival information resources are important information resources in our country. In particular, ensuring the security of archival information under the network environment is not only the subject of archival information work but also the important responsibility of the national information department. Once the problem of archival information security occurs under the network environment, the loss will be incalculable. In the process of developing archival information service under the network environment, we should do a good job in the construction of archival information database and deal with the relationship between network archival information service and archival information security. These problems affect the level and quality of archival information service in the network environment to different degrees. The initial stage of multimedia technology has achieved great success and shows a broad application prospect. But in designing some advanced multimedia systems, technical requirements far exceed what is available today. This shows that the further development of multimedia applications is expected to be driven by new breakthroughs in multimedia technology. At present, the research of distributed multimedia technology is one of the keys. With the continuous development of information and network technology, the traditional way of file management needs to be reformed. People have been used to the network to study and discuss, the network literature archive management to adapt to the development trend. Under the environment of ASP.NET development technology, this paper constructs a network file acquisition and storage system based on multimedia information technology. The document archives management system basically realizes the document archives management informationization business flow. The completion of this subject has certain practical significance and value in order to improve the automation level of literature archive management in Our country. The construction of document archive information service management is a dynamic development process, and the informatization of document archive management is an inevitable trend. The test results show that the interface of the multimedia information system proposed in this paper is beautiful and simple, convenient to use, low cost, and clear and smooth playback content. This paper realizes the goal of the system and all the requirements of the project, meets the application conditions of the project, and can be used widely.    The interface for server-side users to download les