Vtome.ru - электронная библиотека

  • Добавил: literator
  • Дата: 26-07-2023, 22:32
  • Комментариев: 0
Название: Delta Lake: Up and Running: Modern Data Lakehouse Architectures with Delta Lake (5th Early Release)
Автор: Bennie Haelen, Dan Davis
Издательство: O’Reilly Media, Inc.
Год: 2023-07-25
Страниц: 157
Язык: английский
Формат: epub
Размер: 10.2 MB

With the rapid growth of big data and AI, organizations are quickly building data products and solutions in an ad-hoc manner. But as these data organizations mature, it's apparent that their analysis and Machine Learning (ML) models are only as reliable as the data they're built upon. The solution? Delta Lake, an open-source format that enables building a lakehouse architecture on top of existing storage systems such as S3, ADLS, and GCS. In this practical book, author Bennie Haelen shows data engineers, data scientists, and data analysts how to get Delta Lake and its unique features up and running. The ultimate goal of building data pipelines and applications is to query processed data and gain insights from it. You'll learn how the choice of storage solution determines the robustness and performance of the data pipeline, from raw data to insights. Delta Lake brings capabilities such as transactional reliability and support for UPSERTs and MERGEs to data lakes while maintaining the dynamic horizontal scalability and separation of storage and compute of data lakes. Delta Lake is one the enablers for building Data lakehouses, an open data architecture that combines the best of data warehouses and data lakes.
  • Добавил: SCART56
  • Дата: 26-07-2023, 04:18
  • Комментариев: 0

Название: Основы проектирования приложений баз данных (2-е изд.)
Автор: Баженова И.Ю.
Издательство: М.: НОУ "Интуит"
Год: 2016
Страниц: 237
Формат: PDF
Размер: 20 Мб
Язык: русский

Курс знакомит слушателей с различными подходами используемыми при реализации доступа к источникам данных, приводится анализ существующих методов доступа к данным, включая ODBC, OLE DB и ADO, рассматриваются механизмы публикации удаленных источников данных в Интернет. В курсе приводится обзор классов, используемых для работы с базами данных, предоставляемых системами программирования Delphi, JBuilder и Microsoft VisualStudio .NET.
  • Добавил: literator
  • Дата: 25-07-2023, 12:34
  • Комментариев: 0
Название: Statistical Models and Methods for Data Science
Автор: Leonardo Grilli, Monia Lupparelli, Carla Rampichini
Издательство: Springer
Год: 2023
Страниц: 186
Язык: английский
Формат: pdf (true), epub
Размер: 18.0 MB

This book focuses on methods and models in classification and data analysis and presents real-world applications at the interface with Data Science. Numerous topics are covered, ranging from statistical inference and modelling to clustering and factorial methods, and from directional data analysis to time series analysis and small area estimation. The applications deal with new developments in a variety of fields, including medicine, finance, engineering, marketing, and cyber risk. Analyzing categorical data in Machine Learning generally requires a coding strategy. This problem is common to multivariate statistical techniques, and several approaches have been suggested in the literature. This article proposes a method for analyzing categorical variables with neural networks. Both a supervised and unsupervised approaches were considered, in which the variables can have high cardinality. Some simulated data applications illustrate the interest in the proposal.
  • Добавил: tatanavip
  • Дата: 25-07-2023, 08:52
  • Комментариев: 0

Название: Магия таблиц. 100+ приемов ускорения работы в Excel (и немного в Google Таблицах)
Автор: Ренат Шагабутдинов, Евгений Намоконов
Издательство: МИФ
Год: 2023
Формат: pdf, fb2
Размер: 73 Мб
Качество: Хорошее
Язык: Русский

Подробное и интересное руководство с новейшими инструментами, которые помогут разобраться во всех возможностях Excel и Google Таблиц.
Вас ждут более 100 функций, инструментов, нюансов и горячих клавиш, примеры из настоящей рабочей практики с нескучным сюжетом. А также бонус: ссылки на книги Excel для самостоятельной практики.
  • Добавил: SCART56
  • Дата: 25-07-2023, 07:51
  • Комментариев: 0

Название: Apache Kafka. Потоковая обработка и анализ данных
Автор(ы): Нархид Ния, Шапира Гвен, Палино Тодд
Издательство: СПб.: Питер
Год: 2019
Страниц: 320
Формат: PDF
Размер: 12 Мб
Язык: русский

При работе любого enterprise-приложения образуются данные: это файлы логов, метрики, информация об активности пользователей, исходящие сообщения и т. п. Правильные манипуляции над всеми этими данными не менее важны, чем сами данные. Если вы — архитектор, разработчик или выпускающий инженер, желающий решать подобные проблемы, но пока не знакомы с Apache Kafka, то именно из этой замечательной книги вы узнаете, как работать с этой свободной потоковой платформой, позволяющей обрабатывать очереди данных в реальном времени.
  • Добавил: literator
  • Дата: 24-07-2023, 16:35
  • Комментариев: 0
Название: Secure Data Mining
Автор: Jocelyn O. Padallan
Издательство: Arcler Press
Год: 2022
Страниц: 244
Язык: английский
Формат: pdf (true)
Размер: 10.1 MB

Data mining is a process to extract useful knowledge from large amounts of data. To conduct data mining, we often need to collect data. However, privacy concerns may prevent people from sharing the data and some types of information about the data. How we conduct data mining without breaching data privacy presents a challenge. Secure Data Mining provides solutions to the problem of data mining without compromising data privacy. This professional book is designed for practitioners and researchers in industry, as well as a secondary textbook for advanced-level students in Computer Science. Fundamentals and basic concepts regarding data mining are given in Chapter 1 which include data types, information gained from the data, and usefulness of the data mined. Chapter 2 provides detailed knowledge about the security of the data in the process of data mining. A number of approaches of security including classification and detection of data, clustering of data, intrusion detection systems etc. are discussed in this chapter. Classification approaches of the data are discussed in Chapter 3 of this book. Categorization of data and categorization techniques, preprocessing of data and feature selection are the presented in this chapter. Chapter 4 discusses the application of secure data mining in fraud detection. This chapter gives overview of the existing fraud detection systems and compares it with the secure system of fraud detection.
  • Добавил: literator
  • Дата: 22-07-2023, 22:11
  • Комментариев: 0
Data Fabric and Data Mesh Approaches with AIНазвание: Data Fabric and Data Mesh Approaches with AI: A Guide to AI-based Data Cataloging, Governance, Integration, Orchestration, and Consumption
Автор: Eberhard Hechler, Maryela Weihrauch, Yan (Catherine) Wu
Издательство: Apress
Год: 2023
Страниц: 440
Язык: английский
Формат: pdf (true), epub
Размер: 41.6 MB

Understand modern data fabric and data mesh concepts using AI-based self-service data discovery and delivery capabilities, a range of intelligent data integration styles, and automated unified data governance—all designed to deliver "data as a product" within hybrid cloud landscapes. This book teaches you how to successfully deploy state-of-the-art data mesh solutions and gain a comprehensive overview on how a data fabric architecture uses Artificial Intelligence (AI) and Machine Learning (ML) for automated metadata management and self-service data discovery and consumption. You will learn how data fabric and data mesh relate to other concepts such as data DataOps, MLOps, AIDevOps, and more. Many examples are included to demonstrate how to modernize the consumption of data to enable a shopping-for-data (data as a product) experience.
  • Добавил: SCART56
  • Дата: 22-07-2023, 10:55
  • Комментариев: 0

Название: Основы проектирования баз данных
Автор(ы): Шитов В.Н.
Издательство: Инфра-М
Год: 2023
Страниц: 237
Формат: PDF
Размер: 59 Мб
Язык: русский

В учебном пособии описаны основные понятия баз данных, взаимосвязи в моделях и реляционный подход к построению моделей, этапы проектирования баз данных, проектирование структур баз данных, организация запросов SQL и многое другое. Приведено 18 практических работ.
  • Добавил: magnum
  • Дата: 22-07-2023, 04:54
  • Комментариев: 0
Windows 10 For Beginners - 15th Edition 2023Название: Windows 10 For Beginners - 15th Edition 2023
Автор: Papercut Limited
Издательство: Papercut Limited
Год выхода: 2023
Страниц: 122
Формат: PDF
Размер: 82,5 MB
Язык: английский

Windows 10 For Beginners is the first and only choice if you are new adopter and want to learn everything you’ll need to get started with your new operating system. This independent manual is crammed with helpful guides and step-by-step fully illustrated tutorials, written in plain easy to follow English. Over the pages of this new user guide you will clearly learn all you need to know about out of the box set up, getting to grips with the more advanced features and discover a huge array of amazing apps. With this unofficial instruction manual at your side no problem will be unsolvable, no question unanswered as you learn, explore and enhance your user experience.
  • Добавил: literator
  • Дата: 21-07-2023, 19:39
  • Комментариев: 0
Название: Think Like a Data Analyst (MEAP v3)
Автор: Mona Khalil
Издательство: Manning Publications
Год: 2023
Страниц: 207
Язык: английский
Формат: pdf, epub
Размер: 17.9 MB

Learn the technical and soft skills you need to succeed in your career as a data analyst. Think Like a Data Analyst is full of sage advice on how to be an effective data analyst in a real production environment. Inside, you’ll find methods that maximize the impact of your work, from choosing the right analysis approach to effectively communicating with stakeholders. You’ll soon understand the nuances and challenges of real data science projects, with the kind of insights that only come from years of experience. Without doubt, technical skills in Python, R, SQL, along with knowledge of statistics and data science are vital to your success as an analyst. However, they’re only part of the picture. This one-of-a-kind guide reveals the soft skills, best practices, and tools that help you maximize your effectiveness and deliver accurate data-driven decisions in your organization.