Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
A data lake is a centralized repository that allows you to store vast amounts of structured, semi-structured, and unstructured data in its raw format. Unlike traditional data warehouses where data is stored in a structured manner, a data lake retains the data in its native format until it's needed for analysis or processing. ONES provide the capability to store the RAW data of all the Metrics to Cloud and then user will be able to use that RAW data for any deployment or any other use cases.
Storage of Diverse Data Types: A data lake can store various types of data, including structured data (like relational databases), semi-structured data (like JSON, XML), and unstructured data (like documents, images, videos). This flexibility allows organizations to ingest and store data from different sources without the need for extensive preprocessing.
Scalable and Cost-Effective Storage: Data lakes are typically built on scalable storage systems, such as cloud-based object storage (e.g., Amazon S3, Azure Data Lake Storage) or Splunk . These systems can efficiently handle large volumes of data and offer cost-effective storage solutions.
Schema-on-Read Approach: In contrast to traditional data warehouses that use a schema-on-write approach (where data must be structured and conform to a predefined schema before storage), data lakes adopt a schema-on-read approach. This means that data is stored in its original form, and the schema is applied at the time of data retrieval or analysis. This flexibility allows users to apply different schemas and interpretations to the same dataset based on their analytical needs.
Support for Big Data Processing and Analytics: Data lakes serve as a foundational component for big data analytics and processing. Users can perform various analytics tasks, including exploratory data analysis, data mining, machine learning, and real-time analytics, directly on the data lake. Tools like Apache Spark, Apache Hive, and Presto are commonly used for querying and processing data stored in data lakes.
Support for Data Discovery and Self-Service Analytics: Data lakes enable data discovery and self-service analytics, empowering users to explore and analyze data without extensive dependencies on IT teams. Data scientists, analysts, and business users can access relevant data directly from the data lake, speeding up insights generation and decision-making processes.
In summary, ONE DL provides a flexible and scalable platform for storing, managing, and analyzing diverse data types at scale. By leveraging a schema-on-read approach and supporting various analytics tools, ONES DL facilitate advanced data analytics and enable organizations to derive valuable insights from their data assets. However, proper governance, security, and metadata management are crucial to ensure the usability, reliability, and integrity of data lakes.
As of now ONES support 2 different platforms where customer can get the RAW data
Splunk
Amazon S3
Users will have the option to tune the frequency of streaming the metric to the cloud platform, user will have the option to tune frequency starting from 1 minute to 60 minutes.
Users can select/unselect the Network state metrics using the above catalogue option
In data center operations, a rule engine with alerts for various metrics is essential for proactive monitoring and management of critical components and services. Let's discuss the need for rule engine alerts for specific metrics in a data centre environment
CPU and Memory Utilisation
Fan and PSU LED status
SSD Memory Utilization, Health and Temperature Status
Traffic Bandwidth
ASIC Routes
Health Services
Device Down alerts
BGP Neighbour Down alter
Component failure
Interface Flap Alerts
Traffic Errors and Discard Counters
PFC Counters
Device Queue Counters
Rule engine alerts ensure efficient resource utilization, timely troubleshooting, early detection of potential issues, and overall operational stability within the data centre environment.
ONES-App is capable of triggering breached threshold values to
Slack Channel
Zendesk Support
ServiceNow
Device Level
Interface Level
Hierarchy
Metrics
Unit
Measure
Value
Device
CPU Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
Memory Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
Failed Fans
Count ()
MIN/MAX
Count
Device
Failed PSU
Count ()
MIN/MAX
Count
Device
CPU Core Temperature
Celsius ()
AVG/MIN/MAX
Celsius
Device
PSU Temperature
Celsius ()
AVG/MIN/MAX
Celsius
Device
FAN Speed
Percentage (%)
AVG/MIN/MAX
0/100
Device
ASIC IPv4 Routes Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
ASIC IPv6 Routes Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
BGP Nbrs Operationally Down
Count ()
AVG/MIN/MAX
Count of Nbrs
Device
FRR Container CPU Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
Syncd Container CPU Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Device
Device Down
NA
NA
NA
Device
Queue Counter
Count()
AVG/MIN/MAX
Count
Device
SSD Health
Percentage(%)
Percentage(%)
0/100
Device
SSD Temperature
Celsius ()
AVG/MIN/MAX
Celsius
Device
SSD Memory
Percentage(%)
Percentage(%)
0/100
Interface
Int Flap
NA
NA
NA
Interface
PFC Counters
Count ()
AVG/MIN/MAX
Count
Interfaec
Queue Counters
Count ()
AVG/MIN/MAX
Count
Interface
TX Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Interface
RX Utilization
Percentage (%)
AVG/MIN/MAX
0/100
Interface
In Errors
Count ()
AVG/MIN/MAX
User defined
Interface
Out Errors
Count ()
AVG/MIN/MAX
User defined
Interface
In Discards
Count ()
AVG/MIN/MAX
User defined
Interface
Out Discards
Count ()
AVG/MIN/MAX
User defined
Interface
Tranx TX Power
dBm
AVG/MIN/MAX
User defined
Interface
Tranx Rx Power
dBm
AVG/MIN/MAX
User defined
Interface
Tranx Temperature
Celscius ()
AVG/MIN/MAX
User defined
Interface
Tranx Voltage
Volts ()
AVG/MIN/MAX
User defined
Password based agent installation
ONES-Multisite Package
Server_GPU Telemetry Package
New Topology Page:
DataCenter Type
AI-ML Type
Server GPU
Ticketing Tool for Alerts
ServiceNow
QOS Page
Server Health metrics
GPU Based metrics
AI Fabric Telemetry add-on
AI Fabric configuration
RoCE configuration
New Rules added for Server GPU Telemetry
Open Networking Enterprise Suite (ONES) is a Network Orchestration, Visibility, and Assurance solution for multi-vendor and multi-NOS operated Network Infrastructure. ONES provides a one-stop solution from delivering deep visibility into your datacenter networks to extending 24x7 support functions for SONiC. It also hosts a powerful analytics engine that assists users to identify network issues and troubleshoot their networks, in case of common network anomalies and disruptions.
ONES uses Auto-discovery for SONiC devices and a YAML or CSV-based template for adding non-SONiC devices during the onboarding process and continuously collects streaming telemetry data from them to provide insights on:
Data Center Inventory
Network State
Platform and System Health
Control and Data Plane resource Utilisation
Traffic Utilisation
Software Compliance
AI Fabric
Underlay and Overlay protocols view
ONES monitors various control and data plane metrics to provide these insights.
ONESv3.0 application has the capability to trigger notifications via Slack app notifications when certain user-defined threshold values are breached.
In data centre operations, a rule engine with alerts for various metrics is essential for proactive monitoring and management of critical components and services. Rule Engine pushes the configured rule notification in case any device breaches the threshold value configured under the rule to SLACK Channel, Zendesk Support and ServiceNow Ticketing Service integration
Let's see the different types of rule engine metrics for specific Entity/features in a data centre environment
CPU and Memory Utilisation
Fan and PSU LED status
Traffic Bandwidth
ASIC Routes
Health Services
Traffic Errors and Discard Counters
BGP Neighbours flapping notification
Device down status
Link flap status
Device SSD Memory Utilization, Health and Temperature
ROCE Counters
ONES orchestration provide network admins to automate the fabric configuration using configuration templates for provisioning physical interfaces, layer 3 configuration for building IP-CLOS fabric using
BGP as a routing protocol including BGP-unnumbered
Symmetric/Asymmetric IRB
BGP Peering with PO
L2/L3 MC-LAG
EVPN MultiHoming
Layer2 Leaf-Spine (L2/L3 Mode)
Rack-to-Rack Deployment
BGP Peering over MC-LAG PeerLink
BGP Peering using separate Link between MC-LAG Peers
SFLOW
DHCP Relay
RoCE Config
AI Fabric config
SAG / SVI
NTP, SNMP, SYSLOG
Incremental Config update for L2VNI/L3VNI
Enhanced backup and restore options via UI
Enhanced API support - Config Replace
ONES orchestration not only configures the fabric but also make sure the Fabric is operational by doing verifying the configuration at every stage.
ONES provides north bound API access for configurations originating from external orchestration tools.
A data lake is a centralized repository that allows you to store vast amounts of structured, semi-structured, and unstructured data in its raw format. Unlike traditional data warehouses where data is stored in a structured manner, a data lake retains the data in its native format until it's needed for analysis or processing. ONES provide the capability to store the RAW data of all the Metrics to Cloud and then user will be able to use that RAW data for any deployment or any other use cases.
Storage of Diverse Data Types: A data lake can store various types of data, including structured data (like relational databases), semi-structured data (like JSON, XML), and unstructured data (like documents, images, videos). This flexibility allows organizations to ingest and store data from different sources without the need for extensive preprocessing.
Scalable and Cost-Effective Storage: Data lakes are typically built on scalable storage systems, such as cloud-based object storage (e.g., Amazon S3, Azure Data Lake Storage) or Splunk . These systems can efficiently handle large volumes of data and offer cost-effective storage solutions.
Schema-on-Read Approach: In contrast to traditional data warehouses that use a schema-on-write approach (where data must be structured and conform to a predefined schema before storage), data lakes adopt a schema-on-read approach. This means that data is stored in its original form, and the schema is applied at the time of data retrieval or analysis. This flexibility allows users to apply different schemas and interpretations to the same dataset based on their analytical needs.
Support for Big Data Processing and Analytics: Data lakes serve as a foundational component for big data analytics and processing. Users can perform various analytics tasks, including exploratory data analysis, data mining, machine learning, and real-time analytics, directly on the data lake. Tools like Apache Spark, Apache Hive, and Presto are commonly used for querying and processing data stored in data lakes.
Support for Data Discovery and Self-Service Analytics: Data lakes enable data discovery and self-service analytics, empowering users to explore and analyze data without extensive dependencies on IT teams. Data scientists, analysts, and business users can access relevant data directly from the data lake, speeding up insights generation and decision-making processes.
In summary, ONE DL provides a flexible and scalable platform for storing, managing, and analyzing diverse data types at scale. By leveraging a schema-on-read approach and supporting various analytics tools, ONE DL facilitate advanced data analytics and enable organizations to derive valuable insights from their data assets. However, proper governance, security, and metadata management are crucial to ensure the usability, reliability, and integrity of data lakes.
Orchestration refers to tasks or actions required to achieve a set of objectives for your Network Infrastructure operations
A centralized application like ONES translates these objectives into a network configuration template, applies and monitors to validate the operational efficiency and functionality
Automated tasks are performed on your Network Fabric in a purposeful order and each step is verified for success before moving to the next
ONES Orchestration function, referred to as Fabric Manager (FM), lets you compose, deploy, and validate network configurations across any SONiC, be it a Community version or a Vendor distro.
As part of the initial release, ONES Orchestration supports to
Create and configure CLOS topology for ToR, Leaf, Spine, and Super-Spine layers
Apply and validate configurations pre- and post-deployment
Compare running configs against applied configs at any point
Upgrade devices with a single click via ZTP or custom NOS images
Restore & Backup configuration feature
Yaml-based config for VXLAN, MCLAG, BGP IP CLOS & EVPN(L2VPN), EVPN Multihoming, L3 EVPN Symmetric IRB, L3EVPN symmetric IRB with MCLAG.
RoCE configuration parameters supported
Automate Configuration of interfaces, layer 3 interfaces, BGP-unnumbered and Common Services like NTP, SNMP, SYSLOG etc.
ONES Orchestration use cases are configured using a set of pre-defined YAML-based templates on ONES Web User Interface
FMCLI is an Industry standard Command Line Interface
Once the user installs Orchestrator Agent (Fabric Manager Agent) on the device, it enables FMCLI
FMCLI provides a user interface to configure all the open standard protocols and is user-friendly
To use FMCLI, the user can run fmcli command on the device to enter in the configuration mode and can configure the protocols or any other required feature
Example of BGP config using fmcli
NetOps API can be used to integrate into customer-running applications, and can be used to perform the Day 1 and Day N configuration, Using NetOps API a user can do all the configurations and can also perform the Partial Configuration
Day-1 Operations: intent upload
SONiC NOS upgrade
Device Reboot
Difference between the Golden Config and running configuration //running & applied configuration
Backup Running Configuration
Replace Config
If config is done by the ONES then only the user can use this replace option to modify the config.
ZTP Upgrade
ONES Telemetry Collector(s) and Analytics bring truly unparalleled visibility across all your switches running SONiC (both community and vendor distros), regardless of the underlying ASIC. ONES front end (UI), will enable network admins to;
Manage inventory of your network devices running SONiC on Broadcom, Cisco, Marvell, and Nvidia ASICs
View the topology of the entire fabric across multiple hardware platforms, and network operating systems
Monitor traffic, system health, bandwidth utilization, & more
Topology page can be filter as per Datacenter or AI-ML type topology
This page shows Underlay, Overlay, RoCE telemetry view & other Advance Filtering view
The same Topology page allows a user to connect to the device SSH and Console
Enhanced Traffic Page shares the PFC enabled Interfaces
Protocol page shows new metrics of VXLAN, LACP, MCLAG information and QoS related metrics.
Track Switch CPU/memory consumption, bandwidth, link failures, traffic errors, and more in real-time
Proactively identify and resolve issues that may lead to network downtime
Instantly connect to individual devices for maintenance and troubleshooting
Syslog extraction for device, Console access, Add/Remove Non-SONiC devices via YAML or CSV, export or download inventory
Firmware information is added in Device details section
Network Telemetry
Topology visualization
Inventory (Hardware & System, PSU, Fan, SSD etc health)
Control Plane (BGP, LACP, MCLAG, VXLAN)
Data Plane (ASIC Tables Utilization)
Links (Ports Inventory, Transceivers, Flaps, Cabling)
Traffic (In/Out Packets, Errors, Queue Counters, RoCE Visibility)
Support
Image Management (ZTP, Custom)
Access & Management (SSH, Console, SYSLOGs, Reboot)
Network Telemetry
Topology visualization
Inventory (Hardware & System, PSU, Fan, SSD etc health)
Control Plane (BGP, LACP, MCLAG, VXLAN)
Data Plane (ASIC Tables Utilization)
Links (Ports Inventory, Transceivers, Flaps, Cabling)
Traffic (In/Out Packets, Errors, Queue Counters, RoCE Visibility)
Support
Image Management (ZTP, Custom)
Access & Management (SSH, Console, SYSLOGs, Reboot)
Orchestration
Intent-Driven Orchestration for IPCLOS, L2LS, EVPN-VXLAN, EVPN-Multihoming
Policies, System Config & QOS
Day 2 Operations (Incremental Config etc) with API
Config Generate, Validate, Apply & Diff
Integration & Alerting
Rule Engine with alerting on metrics
Integration with Slack, Zendesk and ServiceNow.
ONE-DL
Network Performance Monitoring
Packet Loss & Latency
Network Assurance helps the NetOps team validate policy and security compliance checks before making a change in network configuration, an intelligent set of proactive and predictive techniques that validate the Network for readiness without error, conflicts, and disruptions
Aviz Support team is located across four timezones offering 24x7 SONiC and related product support for multi-vendor switches and ASICs. Using our support portal, we offer you to
Collaborate with our SONiC experts to expedite your evaluations
Speed up your SONiC troubleshooting SLAs to as low as 15 minutes regardless of the underlying Switch/ASIC platform
Minimize operational delays by centralizing issues across multiple platforms
Supports Options are available:
Integrated Chat
Submit a Ticket
Send an email to support@aviznetworks.com
To connect with customer support users can choose the support option available on ONES-UI
ONES is a support application for SONiC stack. It is designed for customer's engineering team such as SRE’s, HW and SW engineering teams for their daily network diagnosis and troubleshooting needs. In addition to that ONES exposes the API to integrate with external tools or customer homegrown applications.
ONES application provides HTTPs over standard port 443 supporting both self-signed and CA signed certificates.
HTTPS Support CA Signed
HTTPS Self Signed
ONES utilizes gRPC infrastructure to communicate with switch agents. TLS (Transport Layer Security) is the primary security protocol used by gRPC to secure communication between the client and the server. TLS provides authentication, confidentiality, and integrity of data. Authentication is achieved using digital certificates, which verify the identity of the client and the server.
With an added extra layer of security, ONESv3.0 support Certificate based communication between switches and ONES Controller, and all the metrics will be streamed using the certificate-based encryption
ONESv3.0 allows a user to use password based authentication between Controller and Switches, all the communication can be done by the implementation of password based agent.
Transport Layer Security (TLS) is a crucial protocol that ensures secure communication between ONES Controller and Agent, Whenever Agent will register to ONES server and further it will start sending the update it will encapsulate all the metrics and will do the encryption based on certificate provided at the time of installation, by using this all the communication will be encrypted between ONES agent and ONES controller TLS relies on digital certificates issued by trusted Certificate Authorities (CAs) to authenticate servers and sometimes clients. These certificates validate the identity of the entities involved in the communication and establish trust in the encrypted connection.
Password-based authentication can be implemented between the agent and controller, allowing all devices to register using a shared password. This method relies on OpenSSL for encrypting the communication, ensuring secure transmission of credentials and data. OpenSSL employs robust encryption algorithms to safeguard the authentication process and prevent unauthorized access.
One single solution to monitor all the data centre availability and one-click access to sites.
The ONES rule engine enables the detection of incidents and the generation of alerts, but this information is limited to the specific site managed by the corresponding controller. While this data is valuable for data center administrators to address and resolve issues, enterprise-level administrators or executives interested in the overall health of the entire network must access individual ONES instances at each data center to view the status. This process can be inefficient and cumbersome.
To solve this, ONES 3.0 Multisite provides a solution by offering a geospatial representation of anomalies across multiple sites, spread over different geographical locations, giving a comprehensive view of network health. ONES instances from data centers around the world can register with a central ONES 3.0 Multisite application. Once registered, the multisite system regularly polls each site for key data, such as the number of managed devices (endpoints) and critical alerts, and presents this information on a map, displaying individual sites and their last contact times. Additionally, ONES Multisite allows users to log in to individual data centers to access more detailed information if needed.
Green blink : Site is reachable with no critical alerts
Red blink : Site is reachable but there are critical alerts
Red : Site is not reachable
Time Series
Average between 2 data points
Data Points
Reference
1Hour
30sec
120
2Hours
40sec
180
4Hours
1Minute 20sec
180
12Hours
5Minutes
144
1Day
10Minutes
144
1Week
1Hour 10Minutes
144
2Weeks
2Hours 20Minutes
144
For more Details on NetOps API check
Users can reach out to customer support on
Refer to the "" section of this document for more details
Role Based Access
ONES provide RBAC support for creating dedicated user accounts. it has a superadmin account which can manage these user accounts for control and permissions
Secure Access to Application
ONES Application provides HTTPS over standard port 443 supporting both self-signed and CA-signed certificates
Secure Access to switches
Auto-discovery communication between Agent and collector using a secure channel(SSL/TLS) with certificates (self-signed and CA-signed certificates
API Access
ONES Application provides HTTPS over standard port 443 supporting both self-signed and CA-signed certificates, the API is available via time-bound authentication tokens.
1G
Accton (AS4625, AS4630), Celestica (DS1000), Supermicro (G3748), Micas(M2-W6510-48GT4V)
10/25G
Accton (AS5812, AS5835, AS7326), Celestica(DS2000), Dell (5212, 5248), Micas(M2-W6510-48V8C)
100G
Accton (AS7712, AS7726, AS7816), Celestica (DS3000), Dell(5232)
400G
Accton (AS9716), Arista (7060), Dell (Z9332)
800G
Celestica (DS4101)
100G
8101-32H, 8102-64H
400G
8101-32FH
1G
Wistron (ES1227, ES2227)
400G
Wistron (6512)
1G
SN2201
10/25G
SN2010, SN2410, SN3420
100G
SN2100, SN2700, SN3700C, SN3800, SN4600C
400G
SN4410, SN4700
800G
SN5600
Cumulus Linux, Arista EOS & Cisco NX-OS platforms are considered by ONES as Agent-less and supports metrics available using NVUE and EOS APIs
Arista
EOS
4.x
Cisco
NXOS
9.x
NVIDIA
Cumulus Linux
5.9, 5.11
SONiC-based switches require ONES Agents (Agent-based) to be installed on the switch being monitored, as a pre-requisite for ONES Telemetry and orchestrator-based functions to work.
ONES Telemetry Agent
ONES Orchestrator Agent
NX-OS expose its own way of metric collection using GRPC
ONES does not support Orchestrator-based functions on Proprietary NOS (non-SONiC).
SSH access
SONiC versions beyond 202012 or 202111 are supported
Only x86 intel-based architectures are supported
Proprietary NOS like Arista EOS, Cumulus, and Cisco NX-OS does not require an ONES Agent and instead leverage the (Agent-less) feature. OpenConfig extends APIs that provide Network Telemetry information about the resources being monitored via (gRPC Network Management Interface) protocol to the ONES Application
ONES Function
Device/Switch Count
Telemetry for Visibility
1024
Orchestration
1024
ONES Installation follows the below steps in the order sequence of:
License Readiness
Preparing and Installing ONES Application machine
Installing ONES Agents on SONiC Switches for Orchestrator and Telemetry
Enabling OpenConfig on non-SONiC Switches for Telemetry
After installation of ONES application user needs to contact AVIZ support to generate a trail license, trail license works for 8 devices and upto 30 days a customer can use.
ONES Application License (As per device count)
Telemetry Visibility (Monitor, inventory, analytics)
Add/remove Non-SONiC devices via YAML or CSV
Support (Zendesk)- Ability to open a ticket with inventory dump
Syslog access, Console/SSH access for device
To obtain a license, contact support@aviznetworks.com providing the below details;
License Duration - In Years (1-5)
Devices Count - 8, 16, 32, 64, 128, 256, 512 or 1024
ONES installation ID
Email ID: (For Account creation)
In the current release, ONES can support managing up to 1024 devices. For ONES Application Installation, the system hardware requirements vary based on the number of devices to manage;
8
x86/x64 based,
16-core CPU
32GB
160GB or more
16
x86/x64 based,
16-core CPU
64GB
320GB or more
32
x86/x64 based,
16-core CPU
128GB
640GB or more
64
x86/x64 based,
32-core CPU
256GB
1.2TB or more
128
x86/x64 based,
32-core CPU
256GB
3 TB or more
256
x86/x64 based,
32-core CPU
512GB
6 TB or more
512
x86/x64 based,
64-core CPU
1TB
12 TB or more
1024
x86/x64 based,
64-core CPU
1TB or more
20 TB or more
Ubuntu 18.0 or later
docker, docker-compose
python3, python3-pip
paramiko
scp
Ubuntu Server
Installer file (Version 18 or higher)
lsb_release -a
Update to latest packages
sudo apt-get update
NA
Install Docker
sudo apt-get install docker.io
docker ps
Install Docker-compose
sudo apt-get install docker-compose
docker-compose version
Install Python3
sudo apt-get install python3
python3 –-version
Install Python3-pip
sudo apt-get install python3-pip
pip3 –-version
Install Paramiko
sudo apt-get install python3-paramiko
pip show paramiko
Install SCP-Client
sudo pip3 install scp
pip show scp
These ports has to be enable on ONES Controller
ONES Collector
50053
These ports has to be enable on Device(Switch)
Switch Access over SSH
22
ONES Monitoring
50052
gNMI Gateway (Telemetry)
9339
ONES Telemetry Database
5432
ONES Orchestrator
8787
ONES Orchestrator Database
2345
pty-server
8885
API-Server
8080
stream-processer
8093
ksqldb-server
8088
kafka-connect
8083
schema-registry
8081
broker
29092, 9101, 9092
Zookeeper
2181
ONES Collector
50053
HTTPS port has to be enable if a firewall is present in between User-machine and ONES-Controller
ONES Web GUI
443
SSH is enabled
Network Reachability from ONES Application
OpenConfig feature is enabled (for non-SONiC Switches)
Google Chrome version 107 or later
Mozilla Firefox version 106 or later
Ubuntu 18.0 or later can be used for Multisite app
ONES Multisite has to be installed on separate server
ONES Multisite use TCP port 443
Reachability between ONES sites and Multisite instance.
ONES requires user to install the below agents on SONiC NOS to allow Network Orchestration and Visibility
ONES Orchestrator Agent for Network Orchestration
ONES Telemetry Agent for Telemetry Data Streaming (Network Visibility)
ONES Server_Agent for GPU Telemetry Data Streaming
ONES Agent 3.0 Version allow the user to add a new controller IP and to change the authentication between controller and agent without doing the complete installation again.
NOTE: for non-SONiC switches,
OpenConfig feature on its NOS needs to be enabled for Network Visibility (Telemetry Data Streaming)
Network Orchestration is not supported
SONiC NOS Upgrade could be done either via
ONES UI (Inventory-->Devices)
Instead of using FM - Orchestrator Agent
Orchestrator Agent takes a backup of FMCLI, ONES Agents and associated services to the /host
folder.
After a successful upgrade, Orchestrator Agent restores these files
Traditional method (ZTP, sonic-installer CLI)
The user needs to reinstall ONES Agents again
Cumulus Networks offers the NVUE (Network Virtualization Utility Engine) API, providing an abstraction layer over traditional configuration mechanisms. This allows for a more intuitive and standardized approach to network configuration and management, echoing modern software development practices.
Arista's EOS platform leverages OpenConfig, a collaborative effort among network operators to define vendor-neutral data models for configuring and managing networks. OpenConfig facilitates simplified, consistent interactions across different network devices.
Cisco's NX-OS supports gRPC, enabling efficient, scalable, and programmatic network device management. This interface allows for the streaming of telemetry data and the execution of configuration commands.
These steps will guide a user how to install a tar-ball package on base Linux based machine
ONES Application system - Installation Steps
Enable super-user mode
user@ones-application:~$ sudo su
Extract the contents of the downloaded ONES Installer tar.gz file
root@ones-application:~$ tar -xvf ONES-3.0.0.tar.bz2
Go to ONES-3.0 folder on the server machine
root@ones-aplication:~$ cd ONES-3.0
Run ones-installer.sh to Install the ONES application function
root@ones-application:~/ONES-3.0$ ./ones-installer.sh
While upgrading there is no dependency of prevision version files, Once the upgrade process is completed, user manually have to delete the previous version files/Packages from the device, Script do not touch old version files
By default, the installer does not provide any license, user needs to contact AVIZ support Team.
ONESv3.0 support SSL certificate integration
User can choose YES if the User wants to integrate their own SSL certificate
Note* Replace the Private key & certificate path to correct location
Choose No, if the user wants to use a self-signed certificate that is integrated into ONES package
Enable or disable the ONE DL feature
User can choose YES if the User wants to enable ONES DataLake feature.
User can choose No if need not to enable ONES DataLake feature
The installation allows a user to enable DB backup
Users can choose the local or remote location to backup the database
By default, the application creates a database backup every 86400 seconds(1 day), but the user can modify it as per the requirement
Local and Remote Backup
By-default application creates 1 backup for local and for remote,
It has a range of 1 to 3 and once it will add one more it will remove the first copy of the database,
User can modify the number of backup files at the time of installation
By-default ONES create ./backup directory to maintain local database
ONESv3.0 support certificate-based authentication between ONES App and devices for GNMI and Auto-discovery
For agent auto-discovery agent will act as a client and the collector as a server. For normal gnmi communication, the agent will act as a server and the collector as a client. Need certificates based on this.
The user needs to provide the certificate path and replace the key name with the path of the certificate to be used here
ONES Application support IP-based Access & FQDN Access
IP based
FQDN based
ONESv3.0 support password-based(MD5) authentication between ONES App and devices for GNMI and Auto-discovery
User can choose yes if need password based authentication
User can choose no if there no requirement of authentication
Users cannot simultaneously select both certificate-based authentication and password-based authentication between the agent and the ONES application. They must choose one authentication method at a time for securing communication between the agent and the ONES app if required.
Installation begins
Access ONES Application Web GUI from a supported browser using https://<host-ip/FQDN>
For a Trail license, the user needs to reach out to AVIZ Support
For Activation, the user can choose Activate License if the user has an activation key of any subscription
Use Default credentials as below;
Username: superadmin
Password : Admin@123
Update/Change your password on the first login
After Resetting the password use new credentials to login
You will see the default Monitor Page with a Topology view
After Trail use, if the user wishes to upgrade new Subscription-based license, so user can navigate to the below page.
Account >> License >> Upgrade License
Click Upgrade License & Enter the subscription-based key
Copy the postgresql.config file outside to do changes
edit the file and uncomment few lines to activate Large config and save the changes
Copy the changes to DB config file
Restart the ones-collector-db
User can download the Latest version of ONES on Support Portal.
Work with Aviz Sales/Support contact to create an account on Aviz Networks Support Portal
Click on the Downloads section, under ONES, click to download ONES Release 3.0
File to install on Ubuntu
ONES 3.0 Installation package
File to install on VMware, ESXI, vSphere, and vCenter setup
ONES 3.0 OVA
File to install on Hypervisor, KVM, LibVirt Manager
ONES 3.0 QCOW2
To download ONES Multisite application
ONES 3.0 Multisite
To download ONE-DL
ONE-DL 1.0
ONESv3.0 Agent support auto-discovery feature
ONESv3.0 Agent support to send telemetry on multiple controllers (Max 2)
Restrict IP feature can be enabled/disabled
Only Collector upgrade after Deployment is possible now
Password-based authentication is supported
Using the Restrict IP feature agent will discover the ONES Controller and will update the entry on the ONES App with all the feature metrics
Need to add a few inputs while installing the agent
Controller IP //To restrict the telemetry streaming
Device Credentials
Layer
Region
type
groupid
azid
brickid
rackid
On the Application machine, go to ONES-3.0/ones_t_agent folder
root@ones-application:~$ cd /ONES-3.0/ones_t_agent
Enter device details (Management IP, Username and Password ) in device_info.csv
root@ones-application/ONES-3.0/ones_t_agent:~$ vi device_info.csv
The user must maintain the layer names exactly as specified above (case-sensitive). If the user inputs names that differ from these, they may encounter issues when using the ONES application.
Save the File
Executing the installation script can be used for installing a telemetry agent on one or more devices in the data centre.
root@ones-application/ONES-3.0/ones_t_agent:~$ ./ones_agent_parallel_installer.sh
Users can use ONES-Agent as an integrated service in SONiC OS or can use it as an independent third-party container.
Users can choose this option to only update one more controller IP without doing the complete agent installation.
Scripts asks to put the Controller IP to use auto-discovery feature
User can choose the restriction to send telemetry to collector IP only
It's important to restrict collector IP as No in case the running network has NAT translation from private to public IP for ONES server access from the device.
User can enable password based authentication between agnet and collector
The user needs to make sure, The devices have a unique name, otherwise, there will issue while plotting the full topology view(Topology Page).
ONESv3.0 Agent support auto-discovery feature
ONESv3.0 Agent support to send telemetry on multiple controllers (Max 2)
Restrict IP feature can be enabled/disabled
Password-based authentication is supported
Using the Restrict IP feature agent will discover the ONES Controller and will update the entry on the ONES App with all the feature metrics
Need to add a few inputs while installing the agent
On the Application machine, go to ONES-3.0/ones_t_server_agent directory
root@ones-application:~$ cd /ONES-3.0/ones_t_server_agent
Enter device details (Management IP, Username and Password ) in device_info.csv
root@ones-application/ONES-3.0/ones_t_server_agent:~$ vi device_info.csv
The user must maintain the layer names exactly as specified above (case-sensitive). If the user inputs names that differ from these, they may encounter issues when using the ONES application.
Save the File
Executing the installation script can be used for installing a telemetry agent on one or more devices in the data centre.
root@ones-application/ONES-3.0/ones_t_server_agent:~$ ./ones_agent_parallel_installer.sh
Users can update only password in case server is already having agent running.
Users can choose this option to only update one more controller IP without doing the complete agent installation.
Scripts asks to put the Controller IP to use auto-discovery feature
User can choose the restriction to send telemetry to collector IP only
It's important to restrict collector IP as No in case the running network has NAT translation from private to public IP for ONES server access from the device.
User can enable password based authentication between agnet and collector
user can choose no if user do not want to use password based authentication between agnet and collector.
The user needs to make sure, The devices have a unique name, otherwise, there will issue while plotting the full topology view(Topology Page).
Using this section user can install ONES-DL backend AWS
For efficient event ingestion, it's crucial to select the right EC2 instance type based on your event volume:
200K events/sec: Start with a minimum of an m5.4xlarge
instance.
Up to 400K events/sec: Scale up to an m5.8xlarge
instance.
Up to 600K events/sec: Opt for an m5.12xlarge
instance.
For higher volumes: Continue scaling in similar increments.
Copy the ONE-DL package to AWS Instance
Untar the ONE-DL package
Modify the .env file with given input
Run ./one-dl-multitenant-installer.sh
On the ONES Application server, go to ONES-3.0/ones_fm_agent
root@ones-application:~$ cd /ONES-3.0/ones_fm_agent
Enter device details (Management IP, Username, Password ) in device_info.csv
root@ones-application:/ONES-3.0/ones_fm_agent:~$ vi device_info.csv
Save the file
Execute the installation script
All these supported methods can be implemented by a single script
Installation: install FMCLI
uninstallation: Uninstall FMCLI
installresetconfig: Install FMCLI and reset the configuration
upgrade: Upgrade FMCLI to the latest version
install_withQOS: Install FMCLI and enable QoS feature
installresetconfig_withQOS: Install FMCLI, reset the configuration, and enable QoS feature
root@ones-application:/ONES-3.1/ones_fm_agent:~$ ./deploy_fmcli.sh
root@Switch:~$ sudo fmcli
root@ones-application:/ONES-3.0/ones_fm_agent:~$ ./deploy_fmcli.sh
Please refer to the link for downloading latest version of ONES Application
NOTE: You are required to sign-up on for getting access to the download page.
Login to with your account credentials
Device
GPU
GPU VENDOR ID
GPU DEVICE ID
NIC VENDOR ID
Intel Gaudi
GAUDI2
HABANALABS (0x1DA3)
GAUDI2 (1020)
HABANALABS (0x1da3)
Intel Gaudi
GAUDI2_HL2000
HABANALABS (0x1DA3)
GAUDI2_HL2000 (1010)
MELLANOX TECHNOLOGIES(0x15b3)
ONES Application can be integrated in the network as a Virtual Machine(VM) Package
QCOW2 Package: Qcow2 can be imported any KVM Hypervisor based application
OVA/OVF Package: OVA can import in
VMware workstation/Fusion
ESXI Server
Virtual-Box
QCOW2 & OVA, both packages are supported for an upgrade to latest version
ONES provides the following subscriptions to manage and monitor the devices.
8
Support up to 8 devices
16
Support up to 16 devices
32
Support up to 32 devices
64
Support up to 64 devices
128
Support up to 128 devices
256
Support up to 256 devices
512
Support up to 512 devices
1024
Support up to 1024 devices
Free(30 Days Trial)
Support up to 8 devices
SONiC Trail License
SONiC Support
SONiC NetOps (Includes SONiC Support)
ONES Agent configuration file allows user to add new collector(controller) after the agent installation if required
To redirect agent telemetry data to a different ONES collector without reinstalling the agent, simply re-run the script with the "Only controller IP addition" option enabled. This process automatically registers the device with the new ONES application and starts the telemetry data stream.
Upon upgrading the Controller IP, it will automatically register with the new ONES
These steps will guide you on how to install the ONES-Multisite
ONES Multisite Application - Installation Steps
Enable super-user mode
user@ones:~$
sudo su
Extract the contents of the downloaded ONES Installer tar.gz file
root@ones:~$
tar -xvf ones-multisite-v3.0.0.tar.bz2
Go to ones-multisite directory on the server machine
root@ones-aplication:~$
cd ones-multisite
Run untar the installer packet to Install the ONES Multisite function
root@ones:~/ones-multisite$
tar -xvf ones-multisite.tar
Load the ONES Multisite docker images
root@ones:~/ones-multisite$
docker load -i ones-multisite.tar
Run Docker service for ONES-Multisite Application
root@ones:~/ones-multisite$
docker-compose up -d
SONiC Devices use auto-discovery
Non-SONiC devices needs to add using YAML editor or using CSV file
Deep Telemetry for ASIC and Switch Hardware
Server_GPU telemetry
Device Inventory details on
Network Operating System (NOS)
Firmware versions - ONIE, BIOS, and CPLD
Hardware SKU, Model, ASIC, and Serial Number
Platform Components – Fan, PSU, Sensors
Link/Interface Health – Speed, Connectivity, Transceivers/Cables
Inventory Operations
Adding/Removing devices using YAML or CSV file
Agent Status Monitoring
Device Monitoring
Device Up/Down State based on Agent and Agent-less
Region and Zone Mapping
Device Roles – Access, Leaf-Spine, Super-Spine
Device Storage monitoring
SSD Temperature
SSD health
SSD memory
Network Compliance with version checks on
Telemetry Agent
Orchestrator Agent
ONIE, NOS, and Linux Distros versions
Resource Trends
CPU and Memory Utilization
PSU and Fan Readings
ASIC Capacity for Routes and ACLs
Software and Kernel Route capacity
Packet Counters – IN/OUT, Errors/Discards
Topology View
Datacentre and AI-ML Topology
Device Connectivity view across Roles and Location
Link/Connectivity Status
Device or Component failure count
Routing Protocol
BGP 2 Byte and 4 Byte AS
BGP Neighbors
Advertised and Received Prefixes
Local AS Number
VXLAN
MGLAG
LACP
RoCE
QOS
Orchestrator Use Cases
YAML-based Configuration push
Image Management via ZTP
BGP Numbered(IPv4 & IPv6) and Unnumbered Configuration
BGP Peering with Port-Channel
NTP, SNMP, SFLOW, and SYSLOG Configuration
VXLAN
Symmetric/Asymmetric IRB
L2/L3 MC-LAG
EVPN MultiHoming
Layer2 Leaf-Spine (L2/L3 Mode)
Leaf only Deployment
BGP Peering over MC-LAG PeerLink
BGP Peering using separate Link between MC-LAG Peers
DHCP Relay
SAG / SVI
QOS
Licensing
Application License
Telemetry Agent License
Orchestrator Agent License
User Management
Add/Edit/Delete User
Role Management
API Access for configurations originating from External Orchestration Tools
Rule Engine
Slack Channel for push notification
Zendesk ticket generation
Service Now ticket alerts
Rules status
Cloud Integration for DL
Splunk
Amazon S3
ONES allows users to leverage pre-defined templates, and customize them for Ports, IPv4/IPv6 Routes, BGP-Unnumbered, and Switch Services (NTP, SNMP, SYSLOG, ZTP, etc.) functions
The user needs to make sure, The devices have a unique name, otherwise, there will issue while plotting the full topology view(Topology Page).
Agent-based devices auto-discover the ONES-App and get registered automatically on the ONES Inventory page
To Onboard the Agent-Less devices user needs to add them manually
Navigate to Inventory
This page gives the control to onboard the devices with two options
Add Devices using the YAML Editor
Upload the CSV file containing the device list
The movement user chooses CSV upload, then the YAML Editor will be disabled
Click on Add Devices
Upload Device Inventory using YAML Editor
Navigate to Inventory >> Devices >> Add Devices >> Use YAML
Use the below format to add devices to the application
Cumulus: cumulus
Arista: arista
Cisco: cisco-nxos
SONiC: sonic
Make sure to use the correct indentation for the YAML files
Click Save & Apply
ONES Application is now ready to manage the added devices
Click on Add Devices
Upload Device Inventory using CSV File
Navigate to Inventory >> Devices >> Add Devices >> Use CSV
Use the below format to add devices to the application
Select CVS file to Upload >> Choose CSV file containing devices entry >>
Upload & Add
Work with Aviz Sales/Support contact to create an account on Aviz Networks Support Portal
Click on the Downloads section, under ONES, click to download ONES Release 3.0
Untar the file
Navigate to aviz_ones
Copy ONES Release 3.0 package (qcow2) to KVM Hypervisor Server
If your host server has Ubuntu Desktop and virt-manager installed you can use it to deploy the VM. Make sure you can start the Virtual Machine Manager and that it connects successfully to the local hypervisor.
Creating a VM with virt-manager is very straightforward, Use the following steps to deploy the ONES-Application
File -> New Virtual Machine -> Import existing disk image -> Forward
Now the ONES Application is ready to use
Create an XML configuration file from the following template using vi
Start the VM
If you see a permission error run the virsh command with sudo may fix the issue
Check the VM status
Work with Aviz Sales/Support contact to create an account on Aviz Networks Support Portal
Click on the Downloads section, under ONES, and click to download ONES Release 3.0
ONES-3.0.0 OVA support below Versions of VMware Family
Login to ESXI >> Create / Register VM
Choose Deploy a virtual machine from an OVF or OVA file >> NEXT
Give it a Valid Name >> Click to select Files or drag/drop (upload from the download folder)
Choose the downloaded OVA package >> NEXT
Choose preferred storage to run ONES-3.0 VM >> NEXT
Choose a Network Adapter to provide DHCP IP to ONES App (Management interface/Eth0) >> NEXT
Verify all the inputs >> FINISH
ONES is Ready to use >> Power On the ONES VM
Once logging into the server CLI using below credentials, please continue with the next steps that is Ones Agent Installation
To access the ONES application, use Server IP/FQDN with HTTPS
https://<host-ip/FQDN>
Use default credentials to login, refer Installing ONES Application page for default credentials
Login to with your account credentials
Login to with your account credentials
VMware ESXI
ESXi 7.0 U2
ESXi 7.0
ESXi 6.5
VMware Fusion
Fusion 13.x
Fusion 12.2.x
Fusion 12.x
VMware Workstation
Workstation 17.x
Workstation 16.2.x
Workstation 16.x
In data centre operations, a rule engine with alerts for various metrics is essential for proactive monitoring and management of critical components and services. Let's see the different types of rule engine alerts for specific metrics in a data centre environment
CPU and Memory Alerts
Fan and Power Supply Unit Alerts
Traffic Bandwidth
ASIC IPv4 & IPv6 Routes
BGP Neighbour Alerts
Health Services
Device Down Alerts
SSD Health, temperature and memory usage alert
Device Queue counters
PFC counters
Traffic Errors and Discard Counters
frr and syncd services CPU utilization status
Server Agent based metrics
CPU Temperature and Utilization
Down status
FAN Speed
Memory Utilization
GPU
Memory Utilization
PSU Power Draw
Temperature
Utilization
Rule Engine pushes the configured rule notification in case any device breaches the threshold value configured under the rule to
Slack channel
Zendesk Support ticket
Service Now ticket
To use Rule Engine Alert feature User needs to setup first Slack channel integration, Zendesk Support integration or Service-Now integration
There are two types of Rule a user can configure
Entity Based
Allow a user to create Rules per device
Allow user to include or exclude the devices from the rule
Entity by Property
Allow a user to create Rules by using HwSKU, Role, OS Version across all the managed devices
Rule Name: The user can choose any related name
For: The user can choose 2 options
Metrics: Metrics depend on the above (For: Device/Interface) condition
Measure: Metrics are measured in three diff ways
MIN
AVG
MAX
Period: Measured metrics can be verified with a buffer of a timer
5 min
10 min
15 min
30 min
1 hour
When Measured Value is: This option allows a user to choose what condition has to match when the measured value is
EQ: Equal to
NEQ: Not Equal to
GE: Greater than Equal to
LE: Less than Equal to
GT: Greater than
LT: Less than
Critical Threshold: The user can set a Critical value on which push notification will be triggered
Warning Threshold: The user can set a Warning value on which push notification will be triggered
Notify: The user can choose the integrated SLACK Channel
Create Ticker: Zendesk Users can choose this to raise the Zendesk support ticket
Create Ticker: ServiceNow Users can choose this to raise the ServiceNow support ticket
Weekly Digest: Slack Users can choose this for Weekly Digest to SLACK Channel
Do not notify if the same alert trigger in: 30min, 1hour, 2hours, 10hours, 24hours
Stop notifying after: The user can choose a value of occurrence then it will not trigger the same in the next 24 hours
Rule Name: The user can choose any related name
Filter: user can filter the rule for all managed devices by
HWSKU
ROLE
OS Version
For: The user can choose 2 options
Select: this option depends on the Filter category, possible values are
Select HWSKU :
Select ROLE :
Select OS VERSION :
Metrics: Metrics depend on the above (For: Device/Interface) condition
Measure: Metrics are measured in three diff ways
MIN
AVG
MAX
Period: Measured metrics can be verified with a buffer of a timer
5 min
10 min
15 min
30 min
1 hour
When Measured Value is: This option allows a user to choose what condition has to match when the measured value is
EQ: Equal to
NEQ: Not Equal to
GE: Greater than Equal to
LE: Less than Equal to
GT: Greater than
LT: Less than
Critical Threshold: The user can set a Critical value on which push notification will be triggered
Warning Threshold: The user can set a Warning value on which push notification will be triggered
Notify: The user can choose the integrated SLACK Channel
Create Ticker: Zendesk Users can choose this to raise the Zendesk support ticket
Create Ticker: ServiceNow Users can choose this to raise the ServiceNow support ticket
Weekly Digest: Slack Users can choose this for Weekly Digest to SLACK Channel
Do not notify if the same alert trigger in 30min, 1hour, 2hours, 10hours, 24hours
Stop notifying after: The user can choose a value of occurrence then it will not trigger the same in the next 24 hours
Navigate to Configurations >> Configure Devices
Allows you to configure new devices
Supports valid YAML files
You can download the sample YAML file, edit it, and upload it again with the desired configuration
While configuring the topology, users can utilize the "Visualize YAML" feature to preview the structure and layout.
Navigate to Configurations >> Configure Devices >> <Upload Valid YAML> >> Visualize YAML
NOTE: Follow the ONES Orchestration to know more about configuration and all the possible use cases.
This section explains how users can add/manage/remove the devices using ONES.
Navigate to Inventory >> Devices
Using this tab, user can:
Onboard the non-sonic(Agent-Less) device to the application using Add devices
Syslogs capture
Upgrade the device using Custom Upgrade
Upgrade the device using ZTP (Zero Touch Provisioning)
Reboot individual devices or multiple devices by selecting them in one click
Remove the devices
Tech-Support download
Complete Inventory can be downloaded in CSV format
The user needs to make sure, The devices have a unique name, otherwise, there will issue while plotting the full topology view(Topology Page).
Agent-based devices auto-discover the ONES-App and get registered automatically on the ONES Inventory page
To Onboard the Agent-Less devices user needs to add them manually
Navigate to Inventory
This page gives the control to onboard the devices with two options
Add Devices using the YAML Editor
Upload the CSV file containing the device list
The movement user chooses CSV upload, then the YAML Editor will be disabled
Click on Add Devices
Upload Device Inventory using YAML Editor
Navigate to Inventory >> Devices >> Add Devices >> Use YAML
Use the below format to add devices to the application
Cumulus: cumulus
Arista: arista
Cisco: cisco-nxos
SONiC: sonic
Server: server
Make sure to use the correct indentation for the YAML files
Click Save & Apply
ONES Application is now ready to manage the added devices
Click on Add Devices
Upload Device Inventory using CSV File
Navigate to Inventory >> Devices >> Add Devices >> Use CSV
Use the below format to add devices to the application
Select CVS file to Upload >> Choose CSV file containing devices entry >>
Upload & Add
Role / Region: Shows the device roles and regions
SKU / ASIC: Shows the device hardware SKU and ASIC vendor
Port / Max Speed
Shows the number of ports per device and max port speed on the device
Click on the number of ports to get a detailed view of all the ports on a particular device
PSUs / Fans: Shows the total number of Power supplies and Fans present on a particular device
NOS Image: Shows the details of the network operating system running on the device and when it was last updated
ONIE Version: Shows which ONIE version is running on the device and when the last reboot time of the device
Agent Version / Network OS: Shows the agent version running on the device and the current active OS version on the device
Agent status / Last contact: Latest status of the Agent and when it was last communicated with that Agent.
Connect: Using this feature we directly get the CLI access of the device
SSH Connect
Console Connect
Details: This last option we can again use to get the details of the device
Navigate to Inventory >> Devices >> Remove Devices
Choose the devices to be removed & confirm
once the user clicks on confirm, the Inventory page will remove the device
If the devices are agent-based they will get added again after some time, if the user wants to remove the agent-based devices, then the user needs to uninstall the agent from the device
Now the selected devices have been removed from the ONES application
This feature gives the control to upgrade the device to the new version
An HTTP image link is required to use the custom upgrade
Select any of the devices to upgrade to the new version
Click on Custom Upgrade
put the new Image URL and then Submit
It will show the status as In Progress
Using this page a user can directly upgrade the box
Select any of the devices to upgrade via ZTP
Click on Upgrade via ZTP
Click on Yes
Here we will see how we can reboot a device using
We have the option to choose one or multiple devices at a time to reboot
Choose one of the devices that we want to reboot
Click on Reboot
Click on Yes
Syslog
Tech Support
Export Inventory
1. Create a Channel for ONES-App push notification
Create an App
Choose From scratch
Provide any App Name and choose the workspace where the user wants to get the push notification & Create App
Choose Incoming Webhook and Activate Incoming Webhooks & Add New Worbhook to workspace
Select the configured Channel & Allow
Copy the newly created webhook link
Open ONES-App and select Integration >> Messaging
Add Channel & Paste the Webhook URL
After saving it will be available to use while creating any rule using Rule Engine feature
Overview
The monitor widget in ONES:
Shows the complete topology view of the fabric
User can change the topology view by:
Datacenter
AI-ML
The Topology view can be categorised by:
Region
Platform
ASICs
Statuses
Not streaming, Faulty Fans, Faulty PSUs, Links Down
Metrics
Bandwidth RX & TX
Memory, CPU & ASIC Utilization
This page shows all the links and information connected to each other
Using a right click on device it allow a user to go to specific feature details
Traffic, Health, Capacity, Protocols
Allow a user to connect device via SSH, Console Access
Can get SYSLOGS
Traffic View
PFC Enabled Device view
input/output packets in million per sec
Errors and Discard packets per interface
And more related metrics
AI-Fabric metrics
Health of the devices
CPU & Memory Utilization
CPU & PSU Temperature
PSU Voltage & Fan Speed
SSD Temperature, Health and Memory utilization
Server GPU Metrics
Capacity of the devices
IPv6 & IPv6 Routes
ASIC/Software/Kernel
ASIC ACL capacity
Links Page
All the connected devices
Transceivers info
Protocols status
BGP status
VXLAN
MCLAG
LACP
QOS
Navigate to Monitor >> Topology
Topology Type: DATACENTER
Topology Type: AI-ML
This shows the complete Topology view, how the devices are connected
Topology can be filtered by Underlay/Overlay/RoCE
We can put the filters to check the customized view of the Topology
Count of devices
All devices onboarded
Not streaming
Faulty Fans & PSUs
Links Down
We can also check Down Links to check the topology those are having links in the shutdown state
When we hover the cursor over any device and use right click we get few more controls
Device details/Ports
Direct Navigation per device
Traffic/Health/Capacity/Protocols
Console connect
SYSLOG
we can also filter the view by using
Statuses
Not streaming, Faulty Fans, Faulty PSUs, Links Down
Metrics
Bandwidth RX & TX
Memory, CPU & ASIC Utilization
Using this widget we can check the input and output errors across all the devices
This widget also shows the input and output packet per device
Navigate to Monitor >> Traffic
This page shows the information:
Device Name & IP
Roles & Region
Device details
Interface speed and ports
Errors & Utilization of the links
Filter Ribbon can be used to get a customized view
PFC Enabled Devices
Operator up/down
Admin down
PFC Enable interfaces
When we click on any device it gives more information about the interface traffic
Errors per interfaces
Bandwidth Utilisation per interfaces
When we click on any particular interface it gives the timescale of the inputs and output packets with Errors and Discards & all metrics in detail
This page shows the traffic drop rate per interface based and will be very useful while doing any troubleshooting for a traffic drop
Using these details a user can check more details inside to fix the issue of dropping/discarding packets
This page shows the latest utilization of all the devices
CPU & Memory utilization
Temperature & Voltage of PSU
fan speed in % & RPM
SSD Temperature, Health and Memory Usage
Navigate to Monitor >> Health
Health Status is reported for the following components
Roles
SKU/ASIC
Ports/Max Speed
CPU Utilization (%)
Memory Utilization (%)
CPU Temperature (℃)
PSU Temperature (℃)
PSU Voltage (V)
Fan Speed (RPM)
SSD
Temperature(℃)
Health(%)
Memory(%)
HOST / IP
Device Name
Device IP
Roles/Region
Device Role
Device Region
SKU/ASIC
SKU (Stock Keeping Unit)
ASIC
Port/Max Speed
Total number of ports available
Speed of ports
CPU Utilization (%)
CPU Utilization reported in 4 states
Normal
Acceptable
Critical - Action needed
Not Streaming - Agent is not up
Click on any device to get the view/status of all the components related to that device
Memory Utilization (%)
Memory Utilization reported in 4 states
Normal
Acceptable
Critical - Action needed
Not Streaming - Agent is not up
Click on any device to get the view/status of all the components related to that device
Average CPU Temperature (C)
Details of the temperature of the CPU across all the devices in degrees celsius
Any device that breaches the configured acceptable or critical value will be shown here
Click on any device to get the view/status of all the components related to that device
Average PSU Temperature (C)
Power Supply Temperature in degrees celsius
Any device that breaches the configured acceptable or critical value will be shown here
Click on any device to get the view/status of all the components related to that device
PSU (Voltage)
Power Supply Voltage readings in volts
Any device that breaches the configured acceptable or critical value will be shown here
Click on any device to get the view/status of all the components related to that device
Average Fan Speed (%)
Fan Speed in % of maximum supported RPM
Any device that breaches the configured acceptable or critical value will be shown here
Click on any device to get the view/status of all the components related to that device
SSD
SSD Status will be shown here
SSD Temperature: will allow a user to track the temperature
SSD Health: will allow a user to check the health utilization in percentage
SSF Memory: this metric will be useful to check the utilization of SSD
We can check the health of the device as per some customization
We can filter the devices by:
Roles
Region
We can choose a role using the available Role-based option
4 Roles available
Super Spine
Spine
Leaf
ToR
Let’s check it with a Leaf filter
After selecting Leaf input, here is the new view of only devices that belong to the Leaf role
This Platform Widget also gives the option to check the extended capability view of the device
Apart from this monitoring view, we can also verify/check extended feature sets like:
PSU Current (A)
PSU Power (W)
Services Running
Services CPU/Memory Consumption (%)
To view per device status with all possible widgets, click on any of the devices present on the list
When we choose a specific device we get an output like this
1
Time Frame: Check Utilization Trends based on Time Range The application has the capacity to store up to 2 weeks of data
2
Refresh Component Status
3
Alerts: show all the alerts triggered by rules
4
Raise a Ticket for Technical Support
5
Documentation
6
API Explorer
7
Device Details
Platform
Number of Ports and Speed
Agent Version
Uptime
CPU Utilization
Memory Utilization
CPU Temperature
Services running on the device
Here we get the complete status of CPU usage with a time range A complete status What was the usage from starting to end
To check a specific time detail we can hover the cursor to any level
Now here we get the status of Memory Usage of selected device
To check a specific time detail with memory utilization, we can hover the cursor to any level
This widget shows us the CPU consumption percentage level of all services / per service.
Here we can see we have the option to check the consumption view of CPU
To check a specific time detail we can hover the cursor on any level
This widget shows us the Memory consumption percentage level of all services / per service.
Here we can also check the consumption view of only Memory.
To check a specific time detail we can hover the cursor on any level
The best widget here for Services
We can also check the total number of count of services running on the platform
This graph shows the red colour bar, red colour show at what time one of the services went down
This template shows the status of CPU temperature in degree celsius
Here we get the status of all the CPU and Core running on the device
This page outlines the key metrics for accurately monitoring the performance of various components, specifically focusing on Temperature, Current, Fans, and Power.
This page allows users to monitor the resource usage of their devices, providing a proactive view of how resources are allocated per switch and the trends in service usage. If you notice any resource usage spiking, you can easily navigate to another utilization page to identify the specific processes contributing to the increased demand. This feature is designed to help users manage their resources more effectively and prevent potential issues before they impact performance.
These metrics can be effectively analyzed through a time series graph.
This page shows the view of Capacity and a few more details related to devices
This widget Shows
Roles/Region per device
SKU and ASIC details per device
ASIC ACL Capacity utilization
IPv4 Routes (ASIC, Software, Kernel)
IPv6 Routes (ASIC, Software, Kerneel)
Roles/Region per device
SKU and ASIC details per device
ASIC ACL Capacity utilization
IPv4 Routes (ASIC, Software, Kernel)
IPv6 Routes (ASIC, Software, Kernel)
This Capacity Widget give us the control to get the output per Role and Region basis also
Let's choose Leaf Role to get the customized view
In the same way, we can customize the view by Region & SKUs
This widget gives us the capability to check the extended view of the Routes & ACL usage with a range of time
Click on any of the devices to get the extended view
Feature
Use
When we move the cursor to metrics this gives the usage view of ipv4:
ASIC
Kernel
Software
When we move the cursor to metrics this gives the usage view of ipv6:
ASIC
Kernel
Software
When we move the cursor to metrics this gives the usage view of ACL:
ASIC
This page gives a view to the user for all the possible connected links between devices with a few more capabilities
Navigate to Monitor >> Links
Hostname
Hostname of the managed device
Role
Role of the device
Port/Interface
Interface details
Port Speed
Link speed of connected devices
Transceiver
SFP/QSFP Optics statuts
Manufacturer
Device Manufacturer
Manufactured Date
Date of Manufacturing
Admin and Operator status
Local and Remote status of link
This page helps a user to get the best view of the number of connections between devices with speed and other manufacturer details
This page gives the exact view of the interface name, interface speed, transceivers and admin & operator status
User can also check the transceiver details with timescale database
This Protocol Page shares the metrics of below features
BGP (numbered/ unnumbered)
VXLAN
MGLAG
LACP
QOS
This BGP section will help a user to know more and accurate number with following details
BGP 2 byte and 4 byte AS
BGP numbered
BGP unnumbered
Total number of neighbors configured
How many neighbors are up and down
total number of prefixes and how many we are advertising
BGP neighbor details
Here we can get:
Device name
Device IP
Here We get the view of:
Roles and Region
this column shares the details of:
SKU
ASIC
This shared the count of total BGP neighbours
This column share the status of
how many BGP neighbours are UP and running
How many BGP neighbours are in Down state
This column shares the Total Prefixes Present in BGP
These are the total number of advertised prefixes by the router to other BGP neighbours
This is the Local BGP AS number
Here we have the control to check more details on neighbours
This shows the status of the neighbour's details, the total number of neighbours, received routes, neighbour RID, BGP AS number & much moreWe have the option here to check the neighbour details and status of RoutesWe can click on neighbours to get more details about all neighbours connected
Feature
Feature
Here we get the details of connected neighbours
Neighbour Device Name
Neighbour IP
Here we get the Neighbour BGP AS number
This shares the neighbour status of uptime, from how long the neighbour is connected
Here we get the detail of the last neighbour reset timer
This share the count of established and dropped connections per neighbour
Here is the view of the Keep Alive timer:
Tx: how many keepalives have been transmitted
RX: how many keepalives have been received
Here is the view of the Route Refresh messages count:
Tx: how many Route-Refresh messages have been transmitted
RX: how many Route-Refresh messages have been received
Here is the view of the Updates Count:
Tx: how many times updates have been transmitted
RX: how many times updates have been received
The user can get per-device status by choosing a particular Device
Click on the device name to get the status
This new page shows the status of BGP neighbours about UP and Down status
On the right side it shows the BGP announcements and the local prefixes present in the BGP table
This section is really helpful in finding the devices with active VXLAN features enabled with a few more details
L2 & L3 VXLAN metrics
Local VTEP
Remote VTEP and details on how many are up and down
VLAN to VNI Mapping
VRF to VNI mapping
Below output shows all the possible devices with VXLAN details
By clicking on Local VTEP ID we get the most accurate details on that device for all the remote VTEP connected to this
Users can click on operation status to get the time series graph about the up nd down status, at what time the VTEP was up or down
Users can get the details on VLAN to VNI mapping using the option on this page
Using the below option user can check all the possible VRF to VNI Mapping
This feature enhances network management by allowing users to access a timescale graph. This graph shows the status of neighbour and peer links over time, indicating periods when they were down or active. Users can further examine the health page to determine whether downtime resulted from process issues or resource utilization problems.
Additionally, the feature provides tools for verifying:
MGLAG (Multi-Chassis Link Aggregation Group) Domain ID
The status of PortChannels associated with Peer Links
MCLAG-L3 (Multi-Chassis Link Aggregation Level 3) status
MCLAG-L2 (Multi-Chassis Link Aggregation Level 2) status
With these capabilities, users can effectively track and diagnose network performance and configuration issues.
Navigate to Monitor >> Protocols >> <Choose Devcice> >> Click on Active/Passive
Using this Page metrics user will get the details on the EtherChannel status, it will show per device etherchannel status with member ports and the status with a time series graph
Selecting a port channel on any device allows users to view its time series data. This feature enables the analysis of the device's status, ranging from the latest hour up to two weeks of metrics.
Quality of Service (QoS) in networking is used to prioritize certain types of traffic, ensuring critical applications like video conferencing, VoIP, or online gaming get sufficient bandwidth. It helps reduce latency, minimize jitter, and maintain consistent performance, especially in congested networks, by managing and controlling traffic flows based on importance.
This page allow a user to check the active configuration related to:
QoS Active Queues on interfaces
802.1p and DSCP to TC mapping
PFC enabled queues
TC to PG and Queue mapping
Scheduler and WRED status
To verify the activated queues, the user can click on "Show Active" This page displays the status of each queue along with the features enabled for each interface.
The "Show All" option allows the user to view all activated configurations per widget, providing a detailed overview of QoS-related settings for each device.
User can onboard all the devices on the application and can get a complete view of all the populated tables
Agent-Based (Switch and Server) devices will automatically added using the auto discovery feature
Agent-Less devices needs to be added using this inventory page
The Inventory tab has the below-mentioned features:
Custom OS upgrade: Upgrade the device OS with any customised image. You need to provide the correct path to ensure the OS is updated successfully
OS upgrade via ZTP: Upgrade the device OS via Zero-touch provisioning
Reboot devices: Reboot the device from a single click in the UI
Remove devices from the dashboard: The user can remove the auto-discovered(Agent based) and non-sonic devices
Add devices from the dashboard:
The User can onboard the non-sonic devices using the YAML file upload or via the in-built editor in the UI
User will have the option to upload CSV file with the list of devices
Sub-options
Syslog: Using this tab user can easily access all the syslogs and can find out the more relative logs directly in case of some failure of any process or any other module of device
Tech Support: User can directly download the techsupport to debug the issue.
Export Inventory: A network engg can easily get the inventory in CSV file
Navigation >> Watcher >> Rules
Create New & Add the required inputs
Preview & Create
Once a user creates the rule it will be available in the rule list
Once the device CPU Utilization goes above the threshold value it will start pushing notifications to SLACK & Zendesk Support tickets and the ONES App Alert Page
Integrations >> Ticketing >> ServiceNow
Add Channel
Inputs to sucessfully integrate ServiceNow
Instance URL (from serviceNow developer Account)
Credentials (from serviceNow developer Account)
Login to the Zendesk Support Admin panel & Follow the steps
click >> Apps & Integration
Choose >> Zendesk API
Enable Token Access
Give API Token Description (Optional)
Copy the API Token
Save the Settings
Open ONES-App and select Integration >> Ticketing
Add Channel & Paste the required details
After saving it will be available to use while creating any rule using Rule Engine feature
Navigation >> Watcher >> Rules
Create New & Add the required inputs
Preview & Create
Once a user create the rule it will be available in the rule list
Once the device SSD Memory Utilization goes above the threshold value it will start pushing notifications to SLACK & Zendesk Support tickets & also inside the ONES Alert Page
login to & choose Your apps
ONES-2.1.0 Application allow users to add third-party tools to get the desired Alerts and metrics in raw format
Wherever there is a breach threshold value in ONES we can get the same notification in 2 different application
Slack Channel Integration: Push notification of all the alerts in the form of a message
Zendesk Support Integration: A Zendesk email alert with payload of triggered value
ONES2.1 allows the use of DataLake, users can integrate 2 different platforms and ONES provides the capability to store the RAW data of all the Metrics in the Cloud and then the user will be able to use that RAW data for any deployment or any other use cases.
Splunk
Amazon S3
ONES2.1 allows the use of DataLake, users can integrate 2 different platforms and ONES provides the capability to store the RAW data of all the Metrics in the Cloud and then the user will be able to use that RAW data for any deployment or any other use cases.
Splunk
Amazon S3
ONES DL provides a flexible and scalable platform for storing, managing, and analyzing diverse data types at scale. By leveraging a schema-on-read approach and supporting various analytics tools, ONES DL facilitate advanced data analytics and enables organizations to derive valuable insights from their data assets. However, proper governance, security, and metadata management are crucial to ensure the usability, reliability, and integrity of data lakes.
When a user creates a rule, and the threshold value is exceeded, alerts will be generated. These alerts will also be displayed on this page.
Notifications from the Alerts Page are always sent to:
ServiceNow Support: Integrated ServiceNow Support will get all the alerts triggered by ONES.
Zendesk Support: Integrated Zendesk Support systems will receive all push notifications.
SLACK Channel: If integrated, notifications will also be sent to the configured SLACK channel.
Alert Page: It will always display the alerts on ONES Alert page
At a time one single Support Ticket can work, While using Zendesk Support User wont be able to use ServiceNow Ticket Support
Alert Management
Count of alerts related to feature
Alert Name
First seen of the alert
Last seen of the alert
Option to delete the alerts
Expand Option is used to check the payload and total alerts
Time Scale Alert Updates
Users can choose the time range to check the more alerts
Alert Page allows a user to download the report in CSV format with a time range
Using this feature setting we can set the acceptable and critical percentage level for the following device components
This page gives control over the widget refresh timer and user idle state
Users can set the manual timer to refresh all the widgets after a time interval (default is 120sec)
Users can set the timer after how many minutes of idle state the ONES-UI should be logout
CPU Utilization(%)
Memory Utilization(%)
CPU Temperature(℃)
PSU Temperature(℃)
PSU Voltage(V)
Fan Speed(%)
SSD Health(%)
SSD Temperature(℃)
SSD Used Memory(%)
As per requirement, we can set different lower and higher threshold values for each component and Users can see the acceptable and critical number of devices in the Monitor Health tab view
Navigate to Monitor >> Platform when any component breaches the higher value
Navigate to Settings >> Thresholds
Change the values as per your requirements
Update it to get these new settings live, after Save Changes
, all these metrics will be reflected on devices metric pages under Inventory
Using this tab user can control the timer of all the widgets refresh and idle timers of the application
Navigate to Settings >> Application
Using this page, users can change the refresh interval in seconds for all the widget
In the dropdown menu, available intervals are:
30 Seconds
60 Seconds
90 Seconds
120 Seconds
Users can set the timeout in minutes(2-60) for the ONES-UI, by default the timeout is disabled
Login to the Zendesk Support Admin panel & Follow the steps
click >> Apps & Integration
Choose >> Zendesk API
Enable Token Access
Give API Token Description (Optional)
Copy the API Token
Save the Settings
Open ONES-App and select Integration >> Ticketing
Add Channel & Paste the required details
After saving it will be available to use while creating any rule using Rule Engine feature
1. Create a Channel for ONES-App push notification
Create an App
Choose From scratch
Provide any App Name and choose the workspace where the user wants to get the push notification & Create App
Choose Incoming Webhook and Activate Incoming Webhooks & Add New Worbhook to workspace
Select the configured Channel & Allow
Copy the newly created webhook link
Open ONES-App and select Integration >> Messaging
Add Channel & Paste the Webhook URL
After saving it will be available to use while creating any rule using Rule Engine feature
The dashboard provides the NetOps with an overview of the data centre. It contains the entire hardware inventory of the network and shows the status whether these switches are streaming or not streaming.
Devices
Status of Switch
Not Streaming: The device is Inactive/Unreachable
Streaming: The device is in a Working state
Non-Licensed: The device is added out of licensed devices
Regions
Status of Regions, their Location and Device Mappings
Switch SKUs
Switch Hardware Vendor, Model Number and SKU
ASICs
ASIC Vendor, Model and Hardware version details
Roles
Device Roles in Customer Environment
Super-Spine
Spine
Leaf
ToR
After the Installation of ONES Application for the first time, the Dashboard is empty and Devices need to be onboarded for them to reflect
Dashboard will be used to
monitor the status of an agent running on all the devices present
Device Roles and associated Regions and
Details of Switch Hardware SKU and ASICs
Navigate to Dashboard >> Components
PSUs
Shows the list of
all faulty Power Supplies across managed switches
LED status of managed switches
Fans
Show the list of
all faulty fans across managed switches
airflow direction of faulty fans for troubleshooting
Transceivers Temperature
Temperature readings and alerts for Optics
Transceivers Voltage
Voltage readings and alerts for Optics
Navigate to Dashboard >> Interfaces
Interfaces
Total number of ports available across devices
Status of Up interfaces across devices
Unused interfaces across the devices
Cabling
Total number of cables used across devices
Cable type used across device
Fiber
Copper
Count of cables required for unused ports
Interface Down
Information on Down Interfaces
Agent Status of a Device for a Down
Interface Historical Flaps starting with 5, 15, 30 minutes and 1 hours interval
Provides two types of Status:
Device Name with interface details
At what time the interface went down
Using this page a user gets the status of
the cables utilized in the network
how many pairs of cable can be used for future topology (helps the admins in capacity planning)
interface to identify if any power failure is happening over the interface
Navigate to Dashboard >> Software
Telemetry Agent version
version across all managed switches
distribution of Agent-based vs Agent-less switches
Device Status (Up/Down) based on Distro
Orchestrator Agent version
version across all managed switches
Agent Health - Up and Down
Network OS
NOS status and version across all managed switches
Distribution based on NOS versions
Device Status (Up/Down) based on NOS versions
Firmware Version
This widget shows the BIOS & ONIE version running on all managed devices
Linux Distro
version across all managed switches
Distribution based on Linux Distros
Devices Status (Up/Down) based on Linux Distro
Integrations >> Ticketing >> ServiceNow
Add Channel
Inputs to sucessfully integrate ServiceNow
Instance URL (from serviceNow developer Account)
Credentials (from serviceNow developer Account)
Steps to onboard splunk
Navigate to Cloud Service >> Integrations >> Splunk
The user needs to add all these mandate details of the Splunk Instance
Nickname //can be any
Endpoint URL
Token
Index
After saving the details it will be available to push the metrics to the Splunk Instance
Using this option user will have control on metrics and the frequency of streaming the metrics to Splunk Instance
Users can modify the metrics as per interest and can set a frequency from 1 minute to 60 minutes
Navigate to Cloud Service >> Integrations >> Amazone S3
The user needs to add all these mandate details of the Amazone S3 instance
Nickname
ARN Role
Region
Bucket Name
External ID
After saving the details it will be available to push the metrics to the Amazon S3 instance
Using this option user will have control over metrics and the frequency of streaming the metrics to the Amazon S3 Instance
Users can modify the metrics as per interest and can set a frequency from 1 minute to 60 minutes
This section explains how large data centers can be designed seamlessly using ONES.
Most fabric orchestration solutions available today are complex and often difficult to understand. ONES provides simple and effective tools, such as predefined templates (YAML file), to configure data centers at scale. ONES allows a customized way of configuring devices that includes enhancements to the standard configuration.
Save Config:
Copy to File:
Restore Config: //If needed only
You need to provide the following inputs to configure the devices:
This section explains the device configuration procedures, that gets applied seamlessly across the entire fabric
Navigate to Configurations >> Devices
Click on the Configure Devices button in the top right corner. This opens a new screen with a sample device configuration. You can edit the configurations directly in the UI to make the desired changes. You can also download the sample YAML file by clicking on Download YAML button, provided at the bottom of the above screen, make new changes and upload the YAML file using Upload YAML button.
Inventory: Specify how many devices you want to add in any particular role - Super Spine, Spine, Leaf, and ToR.
Connectivity: Specify the parameters required to establish link connectivity such as:
Device switch ID: unique ID for every device, required to correctly render the topology
Switch name: hostname of the device
IP address: management address of the device
Autonomous System Number (ASN): BGP AS number to use
Device Credentials: Username & Password.
MCLAG: Multichassis Link Aggregation
Links: links connected with another device
BGP: Specify if you want to enable regular BGP peering or BGP unnumbered peering.
PhysicalIfCfg: Enable or disable FEC and change the MTU settings on all the links being configured.
ASN: Assign a BGP ASN (Autonomous System Number) from the specified pool. Dynamic assigning ASN will be implemented in ONES release 2.0. For release 1.0, you need to specify the ASN under the device configuration, as shown in the sample YAML file.
IPv4Pool: Assign IP pools to different subnets. ONES automatically divides the subnets according to the number of available links.
ONES application uses IPv4 subnets for:
Interfaces
Loopbacks
Host interfaces
To configure BGP neighborship
ONES automatically advertises these subnets in BGP
IPv6Pool: Assign IPv6 subnet.
ONES application uses:
IPv6 subnet for interfaces
Subnets to configure BGP neighborship
Automatically advertise these subnets in BGP
NTP: Provide the NTP server address, to enable NTP. You can choose your desired timezone.
SYSLOG: Provide the SYSLOG server address, to enable SYSLOG.
SNMP: Provide the SNMP server address, to enable SNMP.
Parameters: Provide user the ability to enable VXLAN and related Parameters.
QoS: Provide user the ability to enable RoCE related configuration.
Navigate to Configurations >> Devices >> Configure Devices
Click on Apply Configs button in the bottom right corner of the above screen, to push the configs across the entire fabric.
ONES provides real-time updates when the devices are being configured and validates the configurations automatically to ensure the network is ready to use.
You will see the below screen, after the configurations are successfully verified:
With Few versions of FRR, when user gives reboot, existing config is erased and default config is pushed to FRR. This will lead to missing the configuration for users. below is the workaround config in this situation
Go inside BGP Container & Open docker_init.sh
Look for "Split" Keyword & Comment out the statemets inside that with if conditions
Add a new line which writes Service integrated in vtysh.conf
Comment rm -f /etc/frr/frr.conf & save the changes
Commit the docker Changes
Configuration Page allows a user to connect to the device using console access or using SSH method too.
Navigate >> Configurations >> Devices >> Connect
User can choose SSH or console option to access the device.
Console Logs show the exact configuration loaded on the device and the overview of the config loaded as per YAML & Derived Host IP Range can be viewed from the "Derived_Config:" section
Navigate >> Configurations >> Devices >> Console Logs
Click on the Console Log of the device
This section describes the configuration feature of ONES using which you can check the applied and running configurations.
Navigate >> Configurations >> Devices >> Config
Click on Config
This section describes the compare configuration feature of ONES using which you can compare the applied and running configurations.
Navigate >> Configurations >> Devices >> Compare Configs
Compare Config
This page allows the user to compare applied configurations to the running configuration of a selected device.
The user should select only one device from the list and perform 'Compare Config' on this page, it may take several minutes to fetch the running configuration from the device
YAML Editor will appear with two windows comparing applied and running configurations as depicted in the below picture
The difference in configuration is highlighted with colour coding in the respective window of the configuration
This Section describe the use of Backup & Restore Configuration across all the managed devices
Navigate >> Configurations >> Devices >> Backup & Restore Configs
Using this feature, a user will be able to take multiple backups and will be able to restore the backup at any time.
Select Backup & Restore Configs
Give a Tag Name to the device & Select the device, the device you want to take a backup config
Submit the task
Once submitted the config will backup with the given name & can be used to restore the config at any time in future
Select Backup & Restore Configs
Select Restore Config
Click on the drop-down button of the device on which the user wants to restore the config
Select the backup a user wants to restore from the Backups Available list
Then Submit the backup & Click on Yes to confirm
Use this feature to
Create new users and roles
Remove and suspend existing user one by one
Remove and suspend multiple users at the same time
Password reset of existing users
Only super admin or Enterprise admin can perform these actions
Navigate to Accounts >> Users
Initially, we get one default Admin User Credential
In this view, we get the User status and Role given to it and the Last login time by the user.
On this User tab, we can Add new users and can remove or suspend any existing users
Navigate to Accounts >> Users >> Add
Admin can add
Profile picture
Username
Password
Password should contain:-
Minimum Password Length - 8 characters
Maximum Password Length - 24 characters
Character Support - Alpha Numeric
Special Characters - (# @ $ ! & % only)
Character Rule - At least one Upper Case and one special character
First and Last Name
Role
Enter all the required details
Click Save
"on the first login by the user with given details, ONES application prompt with a screen to rest the password
Now User is ready to login with valid credentials
Navigate to Accounts >> Users >> Edit User >> Reset Password >> Yes
Using Admin credentials you can reset the user password
Click on Reset Password
Submit
& Save
After this user can try login with the temporary password, on the first login, ONES application actively asks to change the password
Navigate to Accounts >> Users >> ((Select Users you want to remove)) >> remove >> Yes
We can remove multiple users at a time
After this, the user will be removed from the database and cannot use credentials to login again
Instead of removing any user, we can also suspend the user
Once we suspend a user it will not be removed from the database but it will be in an Inactive state in the database
Later if we need we can restore the user to its active state
Navigate to Accounts >> Users >> ((Select Users you want to suspend)) >> Suspend >> Yes
Choose users & click on Suspend
Navigate to Accounts >> Users >> ((Select Users you want to restore)) >> Restore >> Yes
Navigate to Account >> Roles
By default, the device comes with these 4 Roles
Super Admin
Enterprise Admin
Enterprise Staff
Vendor Staff
As per the new user access requirement, we can assign limited access to user roles
Navigate to Accounts >> Roles
Now let's add few extra permissions
Add/Remove Devices
Configure Devices
login to & choose Your apps
Configure Devices
Allows to make the config changes directly in the in-built UI editor
Download YAML
Download the sample YAML file and make the required changes
Upload YAML
Upload the modified YAML file
Device configuration is not yet started
In case of the user has not applied any feature like (VXLAN, MCLAG and so on..)
Device configuration is in progress
The device configuration has failed. Move the cursor to the failed option to see the Error message.
Device configuration is completed successfully
Host Logs shows the exact configuration loaded on the device and the overview of the config loaded as per YAML
is used to configure or modify a configuration when we need to add several other devices with the same file
Backup & Restore Configs: using this user can take device config backup and can use later to restore the configuration
This option is used to connect to device via SSH
here user can get the console logs of the particular device
using this option user can get the details on the applied config and the intended config of the device
Compare the applied config vs running config on the selected device and highlight the diff changes, if any